Uploaded image for project: 'Pentaho Data Integration - Kettle'
  1. Pentaho Data Integration - Kettle
  2. PDI-18295

Error org.xml.sax.SAXParseException



    • Type: Bug
    • Status: Closed
    • Severity: Unknown
    • Resolution: Not a Bug
    • Affects Version/s: 8.3.0 GA
    • Fix Version/s: None
    • Component/s: Job, Transformation
    • Labels:
    • Story Points:
    • Notice:
      When an issue is open, the "Fix Version/s" field conveys a target, not necessarily a commitment. When an issue is closed, the "Fix Version/s" field conveys the version that the issue was fixed in.
    • Operating System/s:
      Windows 10


      Good morning,

      For several months I am having a problem executing a job in Pentaho Data Integration.

      When I execute the job, there are times when I get the following error:

      2019/08/16 09:34:32 - QLIK_to_ORIGIN_GI_31_fotos_ALARM_PPTO_VUELO_TEST - Starting entry [XSL_Qlik_Tera]
      2019/08/16 09:34:32 - XSL_Qlik_Tera - ERROR (version, build from 2019-06-11 11.09.08 by buildguy) : Error occurred while processing the XSL transformation
      XML filename [/var/aea/Datateam/QLIK_to_ORIGIN/GESTION_INGRESOS/Alarma/PPTO_VUELO_GI_31_fotos_QLIK_TERA.xml]
      XSL filename [/var/aea/Datateam/QLIK_to_ORIGIN/GESTION_INGRESOS/Alarma/PPTO_VUELO_GI_31_fotos_QLIK_TERA.xsl]
      Error : org.xml.sax.SAXParseException; lineNumber: 4; columnNumber: 1; Content is not allowed in trailing section.

      I have investigated the reason and it is because an .xml file that I am filling in with an xsl transformation is not generated correctly.

      Looking online I have seen that there may be blank spaces in the generated .xml file, but I have reviewed it several times and it is correct.

      It could also be due to the encoding used to generate the .xml, but I have already seen that it is always configured as UTF-8 (correct).

      Another possible cause is that in the XSL transformation step the Transformer factory was not properly configured in the Settings, but JAXP is always set (also correct).

      The xsl files I use are filled in this way:

      <?xml version='1.0' encoding='UTF-8'?><?xml version='1.0' encoding='UTF-8'?><xsl:stylesheet version='1.0' xmlns:xsl="http://www.w3.org/1999/XSL/Transform" xmlns="http://www.w3.org/1999/xhtml"><xsl:output method="xml" indent="yes" encoding="UTF-8"/><xsl:template match="/Results"><html><body><table border="1"> <tr> <th>Count_LINK_Qlik</th> <th>Count_LINK_Tera</th> <th>Ratio_Gestion_Ingresos_Tera</th> <th>margen_error_qlik_tera</th> <th>margen_warning_qlik_tera</th> </tr> <xsl:for-each select="Result"> <tr> <td><xsl:value-of select="Count_LINK_Qlik"/></td> <td><xsl:value-of select="Count_LINK_Tera"/></td> <td><xsl:value-of select="Ratio_Gestion_Ingresos_Tera"/></td> <td><xsl:value-of select="margen_error_qlik_tera"/></td> <td><xsl:value-of select="margen_warning_qlik_tera"/></td> </tr> </xsl:for-each> </table></body></html></xsl:template></xsl:stylesheet>

      And .xml files i use are filled in this way:

      <?xml version='1.0' encoding='UTF-8'?>
      <Result><Count_PPTO_VUELO_Qlik>733</Count_PPTO_VUELO_Qlik> <Sum_REVENUE_Qlik> 0001168445654,41</Sum_REVENUE_Qlik> <Sum_PAX_Qlik>7001910,0</Sum_PAX_Qlik> <Sum_CAPACITY_Qlik>8808105,0</Sum_CAPACITY_Qlik> <margen_error_qlik_ora>1,0</margen_error_qlik_ora> <margen_warning_qlik_ora>0,5</margen_warning_qlik_ora> <Count_PPTO_VUELO_Ora>592</Count_PPTO_VUELO_Ora> <Sum_PAX_Ora>5699643</Sum_PAX_Ora> <Sum_CAPACITY_Ora>7205412</Sum_CAPACITY_Ora> <Sum_REVENUE_Ora> 0000938892713,98</Sum_REVENUE_Ora> <Ratio_PPTO_VUELO_Qlik>23,8175675676</Ratio_PPTO_VUELO_Qlik> <Ratio_PPTO_VUELO_PAX_Qlik>22,8482204938</Ratio_PPTO_VUELO_PAX_Qlik> <Ratio_PPTO_VUELO_REVENUE_Qlik>24,449326</Ratio_PPTO_VUELO_REVENUE_Qlik> <Ratio_PPTO_VUELO_CAPACITY_Qlik>22,2429057492</Ratio_PPTO_VUELO_CAPACITY_Qlik></Result>

      so it doesn't seem like a coding problem.

      I know why the error occurs, but I have no way of preventing it from occurring because it is random, sometimes it happens and sometimes it does not.

      I am working with version, build of Pentaho Data Integration.

      Even if you have the latest version of Pentaho installed, the error was also reproduced in version 7.0, 8.0 and 8.1

      I need help to know if there is something wrongly configured or it is an internal Pentaho error because the jobs I use have to run well 100% of the time.

      Thanks in advance.


        1. Error xml.png
          Error xml.png
          59 kB
        2. solution.png
          291 kB



            project admin Triage
            Jgarciamo Javier García Moreno
            0 Vote for this issue
            2 Start watching this issue