Uploaded image for project: 'Pentaho Data Integration - Kettle'
  1. Pentaho Data Integration - Kettle
  2. PDI-17891

ArrayIndexOutOfBoundsException on executing csv import jobs

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Closed
    • Severity: High
    • Resolution: Fixed
    • Affects Version/s: 8.1.0 GA
    • Fix Version/s: 9.0.0 GA
    • Component/s: Job
    • Labels:
      None
    • Environment:
      Wimdows10 (64bit)
      Oracle 10.2.0.5.0
      JDK 1.8
      PDI-CE-8.1.0.0.365
    • Story Points:
      0
    • PDI Sub-component:
    • Notice:
      When an issue is open, the "Fix Version/s" field conveys a target, not necessarily a commitment. When an issue is closed, the "Fix Version/s" field conveys the version that the issue was fixed in.
    • Sprint Team:
      Tatooine (Maint)
    • Operating System/s:
      Windows 10
    • Steps to Reproduce:
      Hide

      1. make csv file that included timestamp ( They have over 1,000,000 records)
      2. create two csv import jobs
      2. execute importing job at the same time

      CSV_sample

      1,1,"1","AA","6","00","00000000","2",2003/08/19 20:34:25.000000000,"1","1"
      2,1,"1","AA","4","00","00000000","1",2003/08/20 13:32:40.000000000,"1","0"
      3,2,"1","AA","5","00","00000000","1",2003/08/20 13:32:41.000000000,"1","0"
      
      Show
      1. make csv file that included timestamp ( They have over 1,000,000 records) 2. create two csv import jobs 2. execute importing job at the same time CSV_sample 1,1,"1","AA","6","00","00000000","2",2003/08/19 20:34:25.000000000,"1","1" 2,1,"1","AA","4","00","00000000","1",2003/08/20 13:32:40.000000000,"1","0" 3,2,"1","AA","5","00","00000000","1",2003/08/20 13:32:41.000000000,"1","0"

      Description

      When we execute two "import csv" job at the same time, we got ArrayIndexOutOfBoundsException.
      CSV files were included "timestamp" field.

      We don't get that problem when we execute each job alone.

      java.lang.ArrayIndexOutOfBoundsException: 19
         at sun.util.calendar.BaseCalendar.getCalendarDateFromFixedDate(BaseCalendar.java:453)
         at java.util.GregorianCalendar.computeFields(GregorianCalendar.java:2397)
         at java.util.GregorianCalendar.computeFields(GregorianCalendar.java:2312)
         at java.util.Calendar.complete(Calendar.java:2268)
         at java.util.Calendar.get(Calendar.java:1826)
         at java.text.SimpleDateFormat.subFormat(SimpleDateFormat.java:1119)
         at java.text.SimpleDateFormat.format(SimpleDateFormat.java:966)
         at java.text.SimpleDateFormat.format(SimpleDateFormat.java:936)
         at java.text.DateFormat.format(DateFormat.java:345)
         at org.pentaho.di.core.row.value.timestamp.SimpleTimestampFormat.parse(SimpleTimestampFormat.java:373)
         at java.text.DateFormat.parse(DateFormat.java:364)
         at org.pentaho.di.core.row.value.timestamp.SimpleTimestampFormat.parse(SimpleTimestampFormat.java:449)
         at org.pentaho.di.core.row.value.ValueMetaTimestamp.convertStringToTimestamp(ValueMetaTimestamp.java:255)
         at org.pentaho.di.core.row.value.ValueMetaTimestamp.convertData(ValueMetaTimestamp.java:417)
         at org.pentaho.di.core.row.value.ValueMetaBase.convertBinaryStringToNativeType(ValueMetaBase.java:1586)
         at org.pentaho.di.trans.steps.csvinput.CsvInput.readOneRow(CsvInput.java:717)
         at org.pentaho.di.trans.steps.csvinput.CsvInput.processRow(CsvInput.java:151)
         at org.pentaho.di.trans.step.RunThread.run(RunThread.java:62)
         at java.lang.Thread.run(Thread.java:748)
      

        Attachments

          Activity

            People

            Assignee:
            vasco.marques Vasco Marques
            Reporter:
            katsu minoru katsumata
            Votes:
            0 Vote for this issue
            Watchers:
            10 Start watching this issue

              Dates

              Created:
              Updated:
              Resolved: