Uploaded image for project: 'Pentaho Data Integration - Kettle'
  1. Pentaho Data Integration - Kettle
  2. PDI-16527

XML Input Stream step re-writes data in fields from previous steps

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Closed
    • Severity: Urgent
    • Resolution: Fixed
    • Affects Version/s: 7.1.0 GA, Master
    • Fix Version/s: 8.1.0 GA
    • Component/s: Step
    • Labels:
    • Story Points:
      0
    • PDI Sub-component:
    • Notice:
      When an issue is open, the "Fix Version/s" field conveys a target, not necessarily a commitment. When an issue is closed, the "Fix Version/s" field conveys the version that the issue was fixed in.
    • Sprint Team:
      Tatooine (Maint)
    • Steps to Reproduce:
      Hide
      • Download attached zip file, unpack it and open XML_Input_stream.ktr
      • Execute it
      • Select "List of XMLs" step and click on "Preview Data" tab -> it has 13 fields (filename, short_filename, path, type etc.)
      • Click on "Remove extra columns" step -> it has only 3 fields (filename, short_filename, uri). It's expected because the rest were removed by the step.
      • Click on "XML Input Stream" step -> it has 8 fields (3 from the previous step and 5 were added by this step
      • Look to the fields content
        Actual Result: the content of first three fields (filename, short_filename, uri) was rewritten by XML specific information, last three XML specific fields has no data.
        Expected Result: the content of first three fields should remain (filename, short_filename, uri). New information should be added to XML specific fields
      Show
      Download attached zip file, unpack it and open XML_Input_stream.ktr Execute it Select "List of XMLs" step and click on "Preview Data" tab -> it has 13 fields (filename, short_filename, path, type etc.) Click on "Remove extra columns" step -> it has only 3 fields (filename, short_filename, uri). It's expected because the rest were removed by the step. Click on "XML Input Stream" step -> it has 8 fields (3 from the previous step and 5 were added by this step Look to the fields content Actual Result: the content of first three fields (filename, short_filename, uri) was rewritten by XML specific information, last three XML specific fields has no data. Expected Result: the content of first three fields should remain (filename, short_filename, uri). New information should be added to XML specific fields

      Description

      "XML Input Stream (StAX)" step re-writes information in fields from previous steps.
      Due to that issue it doesn't make sense to use predecessor steps for "XML Input Stream (StAX)", the data will be be lost

        Attachments

          Activity

            People

            Assignee:
            pavel_hrakovich Pavel Hrakovich (Inactive)
            Reporter:
            pavel_hrakovich Pavel Hrakovich (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

              Dates

              Created:
              Updated:
              Resolved: