Release Notes - Pentaho Data Integration - Kettle - Version 4.2.0 GA (4.0.0 GA Suite Release) - HTML format

Bug

  • [PDI-757] - TextFileInput with Ignore Errors and Skip Error Lines enabled causes KettleException and ends transformation
  • [PDI-847] - MergeJoin step causes missing key error if changes are made to transformation
  • [PDI-1114] - ping job entry doesn't work on Linux
  • [PDI-1849] - Dimension lookup/update.0 - java.lang.RuntimeException: Error serializing row to byte array
  • [PDI-1876] - Excel Output with Append option is not working
  • [PDI-1886] - Pan deadlock when using 2 different DB connections
  • [PDI-2131] - Step metrics sort order returns to default
  • [PDI-2224] - extra debug messages showing: vfs_cache (value: null)
  • [PDI-2249] - Send mail step - Port is set as String and not integer
  • [PDI-2671] - Text File Output step is slower with lazy conversions enabled
  • [PDI-2740] - Define Error handling not available for "Combination lookup/update" step
  • [PDI-2883] - Formating problem with test button for steps Modified Java Script Value, User Defined Java Class, Script
  • [PDI-3164] - Right click to share an existing database connection in kettle does not work.
  • [PDI-3233] - Filter row step does not allow deletion of conditions after adding them.
  • [PDI-3362] - Deleted folders in Trash have no deleted date value.
  • [PDI-3803] - Cannot re-open job/transformation from file based repository
  • [PDI-3807] - User Defined Java Expression throws exception if a transformation uses "Lazy conversion"
  • [PDI-3909] - Password obfuscation stored in Kettle.property file does not work in Get/Put SFTP/FTP
  • [PDI-4047] - Specifying transformations and jobs by reference should be disabled when the Kettle file repository is used
  • [PDI-4049] - Using Mondrian Input Step with a JNDI defined connection
  • [PDI-4110] - Japanese characters are not displayed on the carte server
  • [PDI-4223] - All connections, slave servers etc. are disappearing from the tree when a filter (to search for a step) is used
  • [PDI-4281] - Spoon.bat error: GOTO not expected
  • [PDI-4296] - Table Output Step Failing Silently When Error Handling is Enabled
  • [PDI-4495] - Partial exports from the repository are no longer supported
  • [PDI-4543] - Transformation "Fields to log" are not read correctly from the repository
  • [PDI-4636] - Trans Log does not update LINES_READ, etc when transformation completes
  • [PDI-4657] - Cannot save transformation from one Enterprise Repository to another
  • [PDI-4666] - kettle-hadoop and kettle-hadoop-ee should be renamed
  • [PDI-4677] - Forward Button Tooltip on Welcome Tab displays incorrect value: SpoonBrowser.Dialog.Forward
  • [PDI-4827] - Database logging configuration still requires to have schema and table name in one field.
  • [PDI-4848] - "Step performance measurement interval" validated even when "Enable step performance monitoring" not enabled.
  • [PDI-4854] - set-pentaho-env.[bat|sh] should check JAVA_HOME before searching for other Java installations
  • [PDI-4857] - PDI crashed whiile running job with a "java.util.ConcurrentModificationException" error
  • [PDI-4868] - mapping information in a Mappin (sub transformation) step do not get saved
  • [PDI-4870] - Cannot run a transformation that contained 2 levels of subtransformation inside enterpise repository
  • [PDI-4877] - MySQL bulk loader uses bad syntax for ignore option
  • [PDI-4902] - Simple Evaluation step is not working when testing if a variable is set
  • [PDI-4922] - java.lang.NumberFormatException: For input string: "4.1.0"
  • [PDI-4923] - Infobright bulk loader escaping of backslashes
  • [PDI-4931] - Salesforce steps - connection test dialog
  • [PDI-4937] - SalesForce Input plugin trouble with updated and Deleted records
  • [PDI-4942] - Clustering sub-transformation step does not work
  • [PDI-4944] - Error Message at mapping component
  • [PDI-4949] - Database class API problem with batch processing
  • [PDI-4967] - PurRepository getAllFilesOfType makes duplicate requests if more than 1 filter is passed in
  • [PDI-4972] - RSS input is not working when executed through kitchen
  • [PDI-4977] - Update PaloCore subfloor files
  • [PDI-4988] - Shared partitions schemas, slave server and kettle cluster schemas are not being saved to the Enterprise Repository when created from a transformation
  • [PDI-5000] - After restoring a deleted transformation it shows up twice in the folder which it used to live in
  • [PDI-5007] - SQL Job Entry: Unexpected end of statement when sql includes comments start with '--'
  • [PDI-5008] - Missing english tooltip for Check DB connections job entry
  • [PDI-5012] - bug PDI-1473 affects also version 4.0.1 and 4.1.0
  • [PDI-5029] - Get table names outputs wrong names
  • [PDI-5038] - Logging output and reject steps
  • [PDI-5053] - PDI assembly should pull all resoureces from Ivy and none from a direct directory reference
  • [PDI-5063] - NullPointer Exception during performing the debugging-mode
  • [PDI-5094] - LucidDB Streaming Loader datatypes are limited to VARCHAR, INT, DATE and BOOLEAN
  • [PDI-5097] - Hadoop TJE initializes the transformation for every row of input
  • [PDI-5142] - Mondrian input step doesn't support jndi
  • [PDI-5144] - PDI Client contains multiple hadoop-core jars
  • [PDI-5155] - Role selection in security dialogue is difficult if not impossible when LDAP contains many users.
  • [PDI-5157] - subtransformations that use olapinput don't work (moved from PRD-3164)
  • [PDI-5158] - Provide more detail for running remotely
  • [PDI-5159] - Export text file in DOS format doesn't work : creates a file in unix format
  • [PDI-5166] - Unique rows step - Dialog not showing count rows field
  • [PDI-5169] - Strange column rename (result --> "result"_KTL) of a newly created job entry log table from
  • [PDI-5172] - Typo in Spoon Welcome page.
  • [PDI-5190] - Add Checksum step gives incorrect results (MD5, CRC32, ADLER32, SHA-1 are affected)
  • [PDI-5195] - PDI input with a subtransformation that uses parameters doesn't work
  • [PDI-5196] - Answering No to "You are about to change the context. All changes will be lost. Do you want to continue?" causes UI and Model to become out of sync
  • [PDI-5198] - Ping Host job entry fails when run from Kitchen
  • [PDI-5200] - Connecting to the embedded H2 database fails when PDI is installed with the installer
  • [PDI-5227] - Excel Output - split rows doesn't work
  • [PDI-5253] - Proxy Server setting for Slave server is missing
  • [PDI-5255] - Spoon required write permissions on .installedLicenses.xml otherwise it fails to start
  • [PDI-5258] - Abort Step - Cancel don't work properly
  • [PDI-5264] - Thread.dumpStack() calls in SchedulerDialogController.
  • [PDI-5265] - Occasional exception when rendering Job Graph
  • [PDI-5273] - Improve error message when attempting to modify the DI Repository administrative role
  • [PDI-5276] - KETTLE_EMPTY_STRING_DIFFERS_FROM_NULL does not work
  • [PDI-5277] - Postgres Bulk Loader doesn't escape quotes in a string
  • [PDI-5280] - Command line import messes up the job to transformation path
  • [PDI-5284] - Passing parameter to a remotely executed job doesn't work
  • [PDI-5294] - An imported job that refers to an imported tranformation is not able to reference the transformation
  • [PDI-5297] - Concurrent use of DB repository explorer through Spoon does not show updates from other clients
  • [PDI-5314] - XBase.java don't support Integer (datatype 73)
  • [PDI-5349] - Date filter in step 'Email messages input' gives 'Unparseable Date' error.
  • [PDI-5350] - Repository login failed. Invalid userid or password - SOAPFaultException
  • [PDI-5355] - PDI repos: 4.1.2 After deleting joe folder and assiging joe to new folder null pointer exception occurs upon explore of repos -
  • [PDI-5361] - Job log settings error
  • [PDI-5362] - show output fields on olap input step returns NPE
  • [PDI-5367] - regexeval: capture list doesn't resize properly [PATCH]
  • [PDI-5368] - regexeval: optional capture results in null error [PATCH]
  • [PDI-5369] - regexeval: wrong fields are populated by capture groups
  • [PDI-5370] - regexeval: regexhelper doesn't respect regex compilation options [PATCH]
  • [PDI-5372] - Transformation Note - Ariel is default font, should be Arial instead
  • [PDI-5373] - Dimension lookup / update step UI flaw: the field drop down list is no longer loaded in the Dimension field (when table is changed nor at open dialog)
  • [PDI-5377] - Job log setting - saving to database repository
  • [PDI-5389] - regexeval: regexhelper doesn't respect regex compilation options [PATCH]
  • [PDI-5394] - Hadoop Transformation Jobexecutor does not take variables from job level
  • [PDI-5403] - import repository issues
  • [PDI-5404] - folder deletion in the repository explorer
  • [PDI-5420] - Cannot open Jobs or Xforms from Repository Explorer when using the file based repository (null pointer exception)
  • [PDI-5423] - Character doesn`t display properly at pentaho data integration server with multi byte basis locale.
  • [PDI-5425] - As an ETL developer I need a CTRL-W and CTRL-F4 keyboard shortcut to close a transformation or job
  • [PDI-5429] - Add Checksum step is not generating the same value for the identical data
  • [PDI-5430] - Documentation Incorrect for OSX (README_OSX.txt)
  • [PDI-5435] - Spoon job logging to an table in Oracle fails after upgrade from 4.1 to 4.1.2
  • [PDI-5439] - The README_OSX.txt file is providing misinformation about the availability of a .dmg file
  • [PDI-5441] - Spoon Runs out of memory monitoring transform on DIS
  • [PDI-5448] - Number format leads into sql alter table loop
  • [PDI-5461] - Received date filter of Get Mails POP3/IMAP Job doesn't seem to work
  • [PDI-5462] - Debugging Get Mails POP3/IMAP job throws java.lang.NullPointerException
  • [PDI-5479] - Files input Steps - exclude files option not saved in repository
  • [PDI-5485] - NullPointerException in step.BaseStep.getRowFrom
  • [PDI-5487] - Salesforce Input Limit Off By One
  • [PDI-5492] - Use of the PDI Formula step within a Hadoop TJE transformation
  • [PDI-5495] - Clustering: remote input data is not being passed correctly directly into a clustered Mapping
  • [PDI-5496] - StartRow and StartColumn of an ExcelInput-Sheet are lost when the transformation is exported to the repository
  • [PDI-5507] - SFTP zlib compression class not found error
  • [PDI-5510] - Import of exported PDI Enterprise Repository XML erases all "by reference" references in sub job/transactions
  • [PDI-5514] - Partitioning on non-Integer columns in PDI sometimes causes an out-of-bounds exception
  • [PDI-5517] - Excel Input - Required file set to N and no file available - input fails
  • [PDI-5519] - Nullpointer Exception opening a Transformation (database repository)
  • [PDI-5520] - Dimension Lookup/Update: the step dialog is too big for smaller resolutions.
  • [PDI-5522] - LucidDB Streaming bulk loader step doesn't load all metadata from the repository
  • [PDI-5529] - XML Output truncated if zipped
  • [PDI-5530] - Problem loading additional classes for a Hadoop Job (using the Hadoop Job Executor)
  • [PDI-5532] - "Memory Group By" step aggregates wrongly
  • [PDI-5534] - Javascript step bug (possibly due to data grid step). Won't work unless a comment with "email" in the comment somewhere in the script is added.
  • [PDI-5536] - Step 'Email messages input' does not handle reply to well when not available (null pointer exception)
  • [PDI-5541] - Get Table Names cannot only get views
  • [PDI-5542] - Show that StyledTextComp use variables - Add red diamond image
  • [PDI-5547] - Mondrian Input and JNDI Data Source, cant see the database
  • [PDI-5560] - exec SQL script - commentary lines '--' cause error
  • [PDI-5590] - UI problem New Step on right click doesn't work
  • [PDI-5599] - Admin User able to change other Admin's home folder
  • [PDI-5605] - PRD-Datasource tests indicate concurrency error when shutting down job
  • [PDI-5607] - The experimental SetValueConstant step looks up the wrong ValueMetaInterface in the updateField(Object[] r) method.
  • [PDI-5608] - ScriptValuesAddedFunctions.java > loadFileContent > case 2 does not read the encoding variable from the javascript function
  • [PDI-5609] - Synchronize after merge don't commit last batch on postgres
  • [PDI-5614] - Oracle Bulk Loader resetting variables to 0 on OK
  • [PDI-6022] - Report Design Wizard duplicates selected columns in Query Results box.
  • [PDI-6027] - Cell Drillthrough in PDI Visualize not working
  • [PDI-6031] - ClassLoader issues with AgileBI and Generic Connections
  • [PDI-6032] - Spoon Visulize - The analyser's features in spoon are not synchronised with the analyser's feature in the user console
  • [PDI-6034] - NPE when opening Visualize->Analyzer
  • [PDI-6037] - Cannot publish model, which uses JNDI connection
  • [PDI-6059] - Spring Beans error saving an Analyzer report
  • [PDI-6063] - "Invalid object name" error from SQL Server when choosing table from non-default schema for model in Agile BI.
  • [PDI-6117] - Spoons Execution Results - Step Metrics table refreshes incorrectly.
  • [PDI-6118] - Initial location of pasted step in Spoon is incorrect if the canvas is scrollable and not at the top.
  • [PDI-6124] - A step not run on a cluster feeding a step that is run on a cluster is being run on the cluster.
  • [PDI-6133] - JoinRows step causes a memory leak
  • [PDI-6142] - Talend job executor fails to execute the same job twice on the same JVM
  • [PDI-6143] - Replace in String Step - Whole Word not working
  • [PDI-6148] - Spoon has UI problems on Ubuntu 11.04 (Natty)
  • [PDI-6173] - Usage of variable KETTLE_LOG_SIZE_LIMIT is not implemented
  • [PDI-6185] - Variables in Headers tab of HTTP job entry are not substituted
  • [PDI-6190] - SingleThreadedTransExecutor sortSteps() method only sorts down which can lead to steps not passing data.
  • [PDI-6191] - Greenplum bulkloader step is not generating encoding properly on the back end
  • [PDI-6195] - CSV Input step looses changes to the step field name in special circumstances
  • [PDI-6199] - SFTP login fails with pubkey-only authentification
  • [PDI-6207] - TransMeta's InputStream constructor won't load the transformation correctly
  • [PDI-6217] - Remove the magic from the PDI_CE_Assemble release build
  • [PDI-6218] - MetaInjection step fails to properly save information in the repository + race condition
  • [PDI-6222] - The Single Threader steps batch size is always passing 1 extra row
  • [PDI-6223] - While executed using the "Single Threader", the "Memory Group By" step needs to reset counters for each batch.
  • [PDI-6230] - PDI BigNumber is not translating well to Postgres Numeric
  • [PDI-6233] - Olapinput is useless and swallows all signs of whats actually gone wrong
  • [PDI-6237] - CLONE -Variables in Headers tab of HTTP job entry are not substituted
  • [PDI-6238] - CLONE -Variables in Headers tab of HTTP job entry are not substituted
  • [PDI-6239] - CLONE -Evaluate File Metrics Job step evaluates Wildcard on entire vfs rather than on file name
  • [PDI-6242] - CLONE -"Add files to result files name" option in job step "Copy Files" causes an error
  • [PDI-6245] - When using undo on a step detach operation in a transformation you can have open-ended hops
  • [PDI-6247] - getXMLData step is performing differently in Spoon when connected via Windows and Linux
  • [PDI-6252] - Cannot open dialog of ETL Metadata Injection
  • [PDI-6253] - 4.2.0 M2 not starting on Mac OS X
  • [PDI-6255] - Using an IN list causes performance problem with the Filter Rows step
  • [PDI-6259] - Pentaho Reporting Output: Excel 2007 option missing
  • [PDI-6260] - Clustering & Slave servers aggregator
  • [PDI-6261] - The logging registry throws out recent entries while implementing the LRU list
  • [PDI-6263] - The CSV File Input doesn't (always) correctly close the file being read.
  • [PDI-6267] - PDI 4.1 fails to open transformations stored in a folder (subdirectory). It issues a null-pointer exception
  • [PDI-6269] - libext/reporting contains jars that are dependent on Kettle - pentaho-metadata, extensions-pmd, extensions-kettle
  • [PDI-6273] - Join Rows step blocks rows when one stream is empty and number of rows coming from the other step is greater than the row set size
  • [PDI-6276] - CSVInput Step: Filename not being added to last row when last item is null and no end-of-line char
  • [PDI-6287] - Pentaho 4.0.1. After ZIP in a job, Advanced / delete files does not delete source files. Worked in PDI 3.2
  • [PDI-6288] - Agile BI Analyzer Vis not functional in RC build
  • [PDI-6289] - license dialog shows up behind the splash screen and we can't get rid of it
  • [PDI-6290] - Spoon is throwing a ClassNotFoundException at startup - Scheduling is not functional
  • [PDI-6291] - Unable to save 2nd 'single thread' step in the database repository
  • [PDI-6295] - Schedule dialog is not being displayed.
  • [PDI-6297] - Please select a Datasource button doesn't appear until Model root node is clicked.
  • [PDI-6299] - Analysis and Reporting tabs disappear when datasource is selected.
  • [PDI-6303] - stop-pec.sh from the tar.gz achive is throwing a NoClassDefFoundError
  • [PDI-6304] - Multiple Transformation (Single Job) Logging Causing MS SQL Server Deadlock
  • [PDI-6305] - After using the PDI installer the following scripts can't find the java executable: start-pec.sh, stop-pec.sh, carte.sh, kitchen.sh, pan.sh and import.sh.
  • [PDI-6306] - Dimension node not populated when opening a saved Modeler (xmi) file.
  • [PDI-6308] - Agile BI nothing works in spoon (model or visualize options) on a ktr
  • [PDI-6309] - Refresh button in Data panel has no tooltip. Not sure what it is supposed to do.
  • [PDI-6312] - "Hiearchy require at least one Level" typo.
  • [PDI-6314] - Enterprise Console does not use schema when querying log tables
  • [PDI-6315] - Analyzer report publish from AgileBI doesn't work. Says it's successful but no new file in solution repository.
  • [PDI-6317] - hbase dependencies must be included in Kettle libext for the new hbase input and output steps to function correctly
  • [PDI-6318] - Two measures based on same field with different aggregations doesn't work.
  • [PDI-6319] - No columns when doing Visualize > Report Wizard off of table output step.
  • [PDI-6333] - Exception is thrown when an attempt to get to the job settings is made through the context menu
  • [PDI-6337] - Agile BI does not display the dimensions of a saved model.
  • [PDI-6344] - Exception adding filename to result
  • [PDI-6346] - When a transformation type is dragged onto the designer canvas the state of the palette tree becomes collapsed
  • [PDI-6352] - The Shell job entry eats the original exception you get when you try to execute a shell that is dynamically created
  • [PDI-6354] - Casting problems with select values step in 4.2.0-RC1 - works fine in 4.1.1
  • [PDI-6357] - MAC installer is throwing a ClassNotFoundException when installing licenses
  • [PDI-6364] - Fuzzy Unit test in TableOutputTest fails on occasion
  • [PDI-6365] - Error saving transformation: Special characters are not allowed in the filename, Please use ASCII characters only.
  • [PDI-6368] - Wrong Hadoop-core jar in DIS
  • [PDI-6372] - NPE finishing "Copy table" wizard
  • [PDI-6379] - Hive history files are being generated by transformations that are not using Hive.
  • [PDI-6382] - CarteSingleton.installPurgeTimer does not release resources from jvm
  • [PDI-6383] - MicrosoftExcelOutput step does not correctly populate wCreateParentFolder control.
  • [PDI-6387] - Auto Documentation step - format PDF - gives Insufficient data for image when opened with adobe reader
  • [PDI-6388] - MySQL bulk loader is throwing error in PDI-4.1.2.1
  • [PDI-6393] - GetXMLData doesn't work when option isIgnoreEmptyFile is checked
  • [PDI-6404] - When executing the same transformation multiple times in parallel in a job you can get duplicate batch IDs in the trans log table
  • [PDI-6406] - Check if a folder is empty gives "Could not create Folder" on linux
  • [PDI-6407] - File based repository: failure to open a job
  • [PDI-6412] - Microsoft Excel Output: Loss of functionality
  • [PDI-6414] - Remove dependency on Jetty for the simple WriterOutputStream
  • [PDI-6423] - "Previous results to args?" option does not work on Truncate tables job entry
  • [PDI-6426] - Autodoc step doesn't save the file type field correctly to a repository
  • [PDI-6427] - Run ssh command save step info in r_jobentry attributes instead of r_step_attributes
  • [PDI-6437] - when transformation/job logging enabled, the database connection doesn't close
  • [PDI-6440] - Kettle logs not printing chained exceptions
  • [PDI-6448] - The "Pad Char" filed of the step "String Operations" is not saved in the repository
  • [PDI-6449] - "Output steps metrics" step is not outputing Rejected entries statistic
  • [PDI-6462] - In a model dimension, delete a level from a hierarchy the the tool tip and properties panel error message reads "Hierarchy must have at least one level, Need at least one level"
  • [PDI-6485] - First DI Server's scheduler is used when connecting to another DI Server in Spoon
  • [PDI-6486] - Cannot schedule newly created Transformation
  • [PDI-6487] - Remove Vertica JDBC driver from the PDI distribution
  • [PDI-6508] - Unzip job entry - Select folder does not work
  • [PDI-6524] - Pig Script Executor entry has the wrong label for Job Entry Name field
  • [PDI-6528] - The google-analytics-input-step has two issues: Localized messages not being resolved and preview/transformations not working
  • [PDI-6530] - Sample rows transformation step does not notify source step that it is complete
  • [PDI-6533] - Reading SFTP PORT fom kettle properties cause error
  • [PDI-6535] - Various localized kettle_document_map.html files are not in sync.
  • [PDI-6546] - FileNotFoundExceptions in nightly PDI EE Spoon.
  • [PDI-6547] - Email Input Step do not load output Fields correctly from repository
  • [PDI-6553] - Analyzer charting not working in Model-View plugin to Spoon (AgileBI)
  • [PDI-6556] - Unzip job entry - Addtime option not loaded correctly from repository
  • [PDI-6557] - Reading SFTP PORT fom kettle properties cause error in "Put a file with SFTP" step
  • [PDI-6566] - Agile BI: Publish fails.
  • [PDI-6569] - Log output has unreplaced placeholders.
  • [PDI-6578] - The getting_started_with_pdi.pdf file is not up to date with the GA release
  • [PDI-6587] - PDI installer is reporting an error - "No origin file found that matched..."
  • [PDI-6596] - Transformations using Palo can not be opened
  • [PDI-6607] - Executing a transformation with performance monitoring enabled crashes Spoon
  • [PDI-6617] - "Salesforce Input" step java.lang.NoClassDefFoundError when excecuted on DI Server
  • [PDI-6620] - Exception when importing repository export from 4.1.1 era
  • [PDI-6623] - welcome page references the admin guide instead of the user guide pdf in linux
  • [PDI-6625] - samples/jobs/hadoop/ does not include a jar file that works with the Hadoop Job Executor in simple mode
  • [PDI-6639] - Welcome screen needs the "Getting started guide" link updated to the correct file "getting_started_with_pdi.pdf"
  • [PDI-6695] - Kettle + mysql connector 5.1.15: AS in select not taken in account
  • [PDI-6696] - SHA-1 Checksum not working
  • [PDI-6731] - repository explorer fails on file-based repo (infinite loop) - Widget is disposed
  • [PDI-7076] - Pan.bat: Missing paths in JAVA_EXT_DIRS
  • [PDI-7290] - Single Threaded Reducer Execution Causes Values to Be Added to Keys
  • [PDI-9232] - Mapping (sub-transformation) fails when corresponding Mapping input has "Include unspecified fields" ticked

Improvement

  • [PDI-899] - As an ETL Designer, I want the ability to add more options to the JVM when executing pan or kitchen
  • [PDI-1318] - "Access input" should be called "MS Access input"
  • [PDI-1741] - Text file output step: allow "No new-line terminator"
  • [PDI-2357] - Batch processing in the Update step
  • [PDI-2666] - CLONE -New Calculator Calculation Type to support standard rounding
  • [PDI-2830] - Allow for flexible optimization of JVM options for use with PDI (e.g. -Xmx, -server options, Garbage Collector, Excessive GC Time)
  • [PDI-3913] - Connector to Lucid DB doesn't support http
  • [PDI-4045] - Repository explorer: Export should be selectable at directory level
  • [PDI-4343] - SalesForce Input: Provide Deletion Date/Time when Retrieving Deleted Records
  • [PDI-4425] - MySQL Bulk Load transformation step should list the columns to be loaded in the LOAD DATA INFILE step.
  • [PDI-4563] - Excel 2007 and OpenOffice spreadsheet support aggregator
  • [PDI-4851] - Locking of database table is very inconvienient
  • [PDI-4856] - Increase the length of the value of the variable in the Combination Lookup step
  • [PDI-4893] - Synchronize the version of JDBC used to access Infobright with that used by report engine
  • [PDI-4963] - Add Excel Writer Plugin from type-exit.org
  • [PDI-4968] - RepositoryDirectoryUI and UIRepositoryDirectory are not making use of a unified job and transformation fetch
  • [PDI-5001] - Excel Writer Step should support recalculating all formulas
  • [PDI-5011] - The excel writer plugin should offer the option to not change styles of existing cells from the template
  • [PDI-5055] - Update DI Server and downstream Kettle dependencies with new jars for google analytics input step
  • [PDI-5151] - Parameters for the XSLT transformation step
  • [PDI-5163] - SFTP PUT job entry - Allow to move local file juste after upload
  • [PDI-5168] - Excel Output - Allow to create automatically parent folder if needed
  • [PDI-5217] - As a CE User, I would like a link on the Welcome Page that tells me how to get the free Agile BI plugin
  • [PDI-5219] - As a CE User, I want an official release of the CE Hadoop Plugin
  • [PDI-5224] - Prioritize step - Extend Append step to many streams
  • [PDI-5228] - Excel Output - ability to set font/color/ for rows and header
  • [PDI-5231] - Upgrade Saxon to 9
  • [PDI-5307] - Upgrade third party dependencies for Kettle in di-server, platform, metadata and reporting
  • [PDI-5353] - PDI can't be a REST client as existing HTTP steps can't do PUT and DELETE (let alone custom methods)
  • [PDI-5364] - The Dialog of the Excel Writer Step should have scrollbars, since it is so big
  • [PDI-5383] - Add Metadata-Injector to Row Denormalizer
  • [PDI-5395] - regexeval: replacement option [PATCH]
  • [PDI-5409] - Improve error reporting for database connection failure
  • [PDI-5426] - Update Jersey jars for downstream PDI dependencies (DI Server, etc) to match those checked into Kettle trunk (4.2.0)
  • [PDI-5427] - Screen size for Oracle Bulk Loader config too large for lower resolution screen
  • [PDI-5453] - Unzip step causing unclear error message
  • [PDI-5457] - Please upgrade the Salesforce Plugins to support version 21 of the Salesforce API
  • [PDI-5493] - Text output step should at least have option to create parent folder if it does not exist.
  • [PDI-5497] - Update all jars in downstream dependencies of PDI to match current versions
  • [PDI-5498] - As an ETL designer I want to limit the number of started threads used in a transformation to minimize overhead
  • [PDI-5502] - Further reduce memory usage of PDI by setting better defaults on the logging options.
  • [PDI-5521] - As an ETL manager I want to have a step that gives me information about the content of a repository
  • [PDI-5526] - The trans step WriteToLog should have a message text area like the job version.
  • [PDI-5527] - Pentaho Output Reporting - Upgrade to PRD 3.8 and integrate in PDI 4.2
  • [PDI-5545] - As an ETL documenter I want to have a step that generates documentation when given filenames or repository paths as input
  • [PDI-5546] - As an ETL Designer, I want the Excel Output step to create Excel 2007 document formats (xlsx)
  • [PDI-5549] - Upgrade the PgSQL JDBC Driver
  • [PDI-5550] - Test, fix and certify support for Cloudera Distribution for Hadoop 3 GA
  • [PDI-5553] - CLONE -Test, fix and certify support for Cloudera Distribution for Hadoop 3 beta 4 (Hive 0.7rc0)
  • [PDI-5606] - The Calculator step should have the option to add hours and minutes to a date.
  • [PDI-5612] - Transition pentaho-database (common dialog) to kettle-dbdialog as part of the core kettle build process
  • [PDI-5613] - Excel output step - Add option to use a temporary file is used during the generation of the workbook
  • [PDI-5985] - As a developer, I want the data source type to be pluggable in the thin client Agile Wizard
  • [PDI-6122] - Facility to configure time out for the Join Row step
  • [PDI-6137] - Please add ability to authenticate to remote server in MongoDB Input
  • [PDI-6139] - The Replace in string step would benefit from an additional Replace by field column.
  • [PDI-6152] - Row-level locking in data transfers between steps inadequate
  • [PDI-6157] - Allow result row variables to be set in the Job and Transformation job entries
  • [PDI-6159] - As an ETL engine embedder I want to be able to specify alternative kettle-steps.xml and kettle-jobentries.xml files
  • [PDI-6160] - I want to be able to run a sequence of steps in a single thread
  • [PDI-6175] - Improve the new Single Threader step with time window & parameters
  • [PDI-6180] - I want the "Set Variables" job entry to have support for reading a properties file
  • [PDI-6206] - SFTP - upgrade jsch lib to 0.1.44
  • [PDI-6209] - As Pentaho, I want the new Greenplum bulkloader packaged as a plugin that works with 4.1.x
  • [PDI-6211] - Allow Oracle to use batch processing in combination with error handling in the Table Output step by giving a clear warning.
  • [PDI-6216] - Fix the Mapping step UI to make the simple cases less complex to configure
  • [PDI-6220] - I want to be able to read output of a source step in an "ETL metadata injection" step.
  • [PDI-6254] - Introduce core plugins to Kettle - gpload, palo
  • [PDI-6268] - As a PDI CE user, I want a quick and obvious link on the welcome page to download and install Agile BI
  • [PDI-6285] - Upgrade Jackess lib to 1.2.4
  • [PDI-6293] - Update PDI Agile BI Visualization (Analyzer) to the latest version
  • [PDI-6338] - Move HL7 Steps into a core plugin
  • [PDI-6409] - Add LIBOVERLAY_SCROLLBAR=0 to spoon.sh for folks on Ubuntu 11.04
  • [PDI-6410] - As Pentaho, I want PDI 4.2 GA to be updated with Pentaho Reporting 3.8.1 GA
  • [PDI-6411] - As Pentaho, I want the 'simple web service' feature of PDI 4.2 to work with the EE DI Server
  • [PDI-6476] - "Get file names" - Exception can't be handled
  • [PDI-6501] - Unzip step does not retain the original file timestamp
  • [PDI-6537] - I'd like to be able to Inject the Metdata in the Fixed File input file step
  • [PDI-6621] - Update PDI distributions to only include PDI related documents
  • [PDI-6870] - Display a message instead of throwing error when trying to open Transformation/Jobs from "Repository Dialogue" dialogue box

New Feature

  • [PDI-26] - As an ETL Designer/Administrator, I want a command line tool to import/export objects from repository
  • [PDI-1690] - Importing many ktr and kjb files into a repository.
  • [PDI-1918] - Add PGP encryption job entries to encrypt or decrypt files
  • [PDI-2792] - Handle large unbalanced XML files
  • [PDI-2905] - Database: Make the use of getColumnLabel and getColumnName configurable due to JDBC driver incompatibilities and specification changes (impact on fieldname alias)
  • [PDI-2931] - LDAP Writer Step
  • [PDI-4242] - As a Hadoop users, I want the ability to use individual steps as Map/Reduce tasks.
  • [PDI-4686] - Open Source Google Analytics Plugin
  • [PDI-5136] - As an ETL Developer I want to see the progress of the transformation as a whole, the individual step metrics, and potential bottlenecks on the canvas
  • [PDI-5211] - As an ETL Designer, I want a BulkLoad step for Greenplum that wraps the gpload facility
  • [PDI-5313] - XML: Create a new step that is capable of processing very large and complex XML files very fast
  • [PDI-5363] - As an ETL developer I need a bulk loader for Ingres VectorWise
  • [PDI-5376] - getxmldata
  • [PDI-5422] - As an ETL designer I want to define the columns of a spreadsheet at runtime using metadata injection
  • [PDI-5428] - As an ETL developer I want to be able to restart a failed job at the point where it failed last.
  • [PDI-5437] - As an ETL developer I want to read data from MongoDB
  • [PDI-5540] - As an ETL developer, I'd like to be able to retrieve Mail Headers from Mail Input step
  • [PDI-5555] - As an evalulator, I would like a BI Suite and PDI installer for the macintosh operating system
  • [PDI-5600] - As a PDI CE user, I would like a simple download link on the welcome page to download the AgileBI plugin for Spoon
  • [PDI-6068] - As a data access and Agile BI user I want the modeler to have a reporting metadata perspective along with the OLAP perspective.
  • [PDI-6123] - As an ETL developer I want to easily integrate with 3rd party tools over a web service
  • [PDI-6145] - Deliver PDI 4.1.3 Patch release
  • [PDI-6325] - Create a new step that retrieves global sequence numbers from a master server
  • [PDI-6369] - Multiway merge join
  • [PDI-6447] - Add symmetric crypto steps
  • [PDI-6561] - Need an option to load the data parallely in oracle bulk loader when the tables are set to lock
  • [PDI-6598] - As an ETL developer, I would like an ElasticSearch Bulk load step

Task

  • [PDI-5535] - As an ETL developer I want to execute Talend Jobs in Pentaho Data Integration
  • [PDI-5539] - As an ETL developer in need of a report I want the Pentaho Reporting Output step to be included in vanilla PDI
  • [PDI-5559] - Port over the existing HL7 plugins from http://code.google.com/p/kettle-hl7/
  • [PDI-6204] - Fix current issues with respect to Error Handling, Performance and Key returning of the Table Output step
  • [PDI-6279] - Remove hard coded TRUNK-SNAPSHOTs from new publish .xml files
  • [PDI-6349] - Rollback AgileBI plugin to previous version for 4.2.0 RC build
  • [PDI-6413] - As Pentaho, I want the 'Generate unique slave ID' step of PDI 4.2 to work with the EE DI Server
  • [PDI-6415] - As Pentaho, I want the 'list (open and free) socket reservations' servlet of PDI 4.2 to work with the EE DI Server
  • [PDI-6517] - Clean up the PDI steps and job entries pages.
  • [PDI-6574] - check in Pentaho Reporting Output sample

Sub-document

  • [PDI-212] - As an ETL Designer, I want the ability to read from and write to OpenOffice calc ods file format
  • [PDI-1499] - Table output steps do not show consistent usage of batch insert mode
  • [PDI-1564] - Table output sqlite error handling is flawed.
  • [PDI-2043] - When usng a dynamic carte cluster, the master keeps accumulating more and more open sockets until eventually you hit the system ulimit and it fails.
  • [PDI-2150] - Table Output : Error handling - when no batch updates: sending all rows, including the error rows, to the next step
  • [PDI-2356] - Inappropriate behaviour of Table Output during unsuccessful commit of batch package - Oracle
  • [PDI-2384] - Committed Code: Multi enable/disable of hops
  • [PDI-2495] - Wrong error handling output from step Table output
  • [PDI-3638] - Evaluate upgrading jxl (java excel) library to latest stable version
  • [PDI-3663] - JDBC SQLite driver is deprecated
  • [PDI-4306] - Get variables at the beginning of a clustered part within a transformation discards all incoming rows
  • [PDI-4309] - Partitioning method "Remainder of division" (Mod partitioner) is not partitioning correctly
  • [PDI-4532] - Table Output and Error Handling with MySQL database: Batch insert mode disabled because of database limitations.
  • [PDI-4664] - Table output error handling changed between PDI 3.2.4 and 4.0.1
  • [PDI-4821] - Transformation parameter value (set in job, on the parameter tab) is not available within the transformation
  • [PDI-4933] - Table Output step doesn't allow batch update after connecting an error output step and then deleting it
  • [PDI-5285] - Shouldn't functionality referenced in PDI-2531 also apply to jobs when executing a job on a remote server from a Job entry
  • [PDI-6116] - Table Output and Error Handling with Oracle database: Batch insert mode disabled because of database limitations.
  • [PDI-6176] - Job/Trans test to ensure correct nested param passing
  • [PDI-6215] - Master slave server reporting that a port is already in use
  • [PDI-6219] - Log file from cluster setup get mixed together
  • [PDI-6221] - Runaway threads in PDI server
  • [PDI-6262] - Make sure to properly shut down the slave monitoring timer when stopping a slave server (Unit test)
  • [PDI-6264] - Allow for multiple parallel runs of the same transformation across a cluster
  • [PDI-6324] - Generate Rows to a clustered step ceases to work
  • [PDI-6330] - Carte Cluster: Non-clustered steps between clustered steps causes hang

Sub-task

  • [PDI-183] - As an ETL Designer, I want the ability to read from a protected workbook via Excel input step
  • [PDI-430] - Sort step has problems with lazy data type resolution where the data comes from CSV files
  • [PDI-4302] - Clustering keeps ports open when errors occur in the initialization phase of the Table output step (eventually other steps)
  • [PDI-5128] - Performance for Error Handling Steps is Very Poor
  • [PDI-5384] - named parameters of job are lost when I use repository
  • [PDI-5515] - Return auto generated key in table output fails on SQL Server 2005
  • [PDI-5567] - User testing
  • [PDI-5574] - GUI Code
  • [PDI-5575] - Back End Code
  • [PDI-5576] - Update Doc & QA Notes: Knowledge Transfer
  • [PDI-5577] - User Testing
  • [PDI-5579] - Unit Tests
  • [PDI-5586] - UX mocks and review GP Load
  • [PDI-6136] - Table Output fails when connected to MS SQL Server (Native) and using Error Handling on the step
  • [PDI-6402] - "We can not find destination field...in input stream!" error in Mail transformation step.

Edit/Copy Release Notes

The text area below allows the project release notes to be edited and copied to another document.