Pump Up Pentaho

Power Transforms, Protect PII, Prototype Operations


While Pentaho Data Integration (PDI) is a powerful tool for preparing and integrating data, it also has some shortcomings:

  • Slow Transforms

    Native sorts, etc. may not run fast enough in high volume.
  • Limited De-ID Features

    Cannot mask or encrypt data flowing through Kettle
  • Limited Test Data

    Cannot prototype ETL jobs without using production data


PDI workflows support system commands, so data can be processed externally without disruption. IRI Voracity or its component software can help Pentaho users in the following ways:

  • Test Your Apps

    • Run IRI RowGen to populate tables, files and reports with synthetic test data that mimics production data
    • Generate structurally- and referentially-correct DB test data for entire EDW
    • Keep production data safe

      Creating Test Data for Pentaho: Read now.