Uploaded image for project: 'Pentaho Data Integration - Kettle'
  1. Pentaho Data Integration - Kettle
  2. PDI-11184

XML Output for UTF-8 Encoding outputs as ANSI

    XMLWordPrintable

    Details

    • PDI Sub-component:
    • Notice:
      When an issue is open, the "Fix Version/s" field conveys a target, not necessarily a commitment. When an issue is closed, the "Fix Version/s" field conveys the version that the issue was fixed in.
    • Operating System/s:
      Windows 7 (64-bit)

      Description

      Between version 4.4 and version 5.0 XML Output changed how it handled UTF-8 encoded characters. In 4.4, the XML file output was ANSI as UTF-8 and displayed the UTF-8 characters. In 5.0.1, the XML file out is ANSI and displays all encoded code points for the characters.

      Most XML readers will read this, but I am attempting to create a complex structure, and the unnecessary code points that are encoded make it more difficult to fix the complex structure.

        Attachments

        1. file_4_4.xml
          123 kB
        2. file_5_0_1.xml
          143 kB
        3. PDI-11184-XML-UTF8.ktr
          14 kB

          Activity

            People

            Assignee:
            Unassigned Unassigned
            Reporter:
            dsplats Daniel
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Dates

              Created:
              Updated:
              Resolved: