Uploaded image for project: 'Pentaho Data Integration - Kettle'
  1. Pentaho Data Integration - Kettle
  2. PDI-2719

Access Input step destroys/mangles contents of binary columns (e.g. "OLE Object" type columns)

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Closed
    • Severity: Unknown
    • Resolution: Fixed
    • Affects Version/s: 3.2.2
    • Fix Version/s: 3.2.3, 4.0.0 GA
    • Component/s: Step
    • Labels:
      None
    • Notice:
      When an issue is open, the "Fix Version/s" field conveys a target, not necessarily a commitment. When an issue is closed, the "Fix Version/s" field conveys the version that the issue was fixed in.
    • Operating System/s:
      Windows XP

      Description

      MS-Access databases can store binary data in columns of type "OLE Object". But the Access Input step turns this data into strings which look like "[B@1f8320".

      Looking at the code for the org.pentaho.di.trans.steps.accessinput.AccessInput class, it seems that conversion of the values returned by the Jackcess library is performed by converting to a string, then to a PDI type. But binary data is returned as byte[], and when converted to a string the contents is lost. However, dates are already treated specially (they are used directly without conversion), and the solution to this bug is to treat byte[] references in a similar way.

      A patch is attached.

        Attachments

          Activity

            People

            • Assignee:
              gdavid Golda Thomas
              Reporter:
              todd Todd Owen
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: