Uploaded image for project: 'Pentaho Data Integration - Kettle'
  1. Pentaho Data Integration - Kettle
  2. PDI-2719

Access Input step destroys/mangles contents of binary columns (e.g. "OLE Object" type columns)

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Closed
    • Severity: Unknown
    • Resolution: Fixed
    • Affects Version/s: 3.2.2
    • Fix Version/s: 3.2.3, 4.0.0 GA
    • Component/s: Step
    • Labels:
      None
    • Notice:
      When an issue is open, the "Fix Version/s" field conveys a target, not necessarily a commitment. When an issue is closed, the "Fix Version/s" field conveys the version that the issue was fixed in.
    • Operating System/s:
      Windows XP

      Description

      MS-Access databases can store binary data in columns of type "OLE Object". But the Access Input step turns this data into strings which look like "[B@1f8320".

      Looking at the code for the org.pentaho.di.trans.steps.accessinput.AccessInput class, it seems that conversion of the values returned by the Jackcess library is performed by converting to a string, then to a PDI type. But binary data is returned as byte[], and when converted to a string the contents is lost. However, dates are already treated specially (they are used directly without conversion), and the solution to this bug is to treat byte[] references in a similar way.

      A patch is attached.

        Attachments

          Activity

            People

            Assignee:
            gdavid Golda Thomas
            Reporter:
            todd Todd Owen
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

              Dates

              Created:
              Updated:
              Resolved: