Voracity Features & Benefits


Data Discovery, Integration, Migration, Governance, Analytics

IRI Voracity® is a full stack data lifecycle management platform that seamlessly combines the best of CoSort®, Eclipse™, Hadoop®, and other best-in-class technologies.

 

You won't believe how much you can do!

 

Since 1978, IRI CoSort has modernized and accelerated data transformation, conversion, and reporting. CoSort is a proven: ETL optimizer, BI data preparation engine, DB load and query accelerator, and it can mask PII, build test data, and report. Today, IRI Voracity harnesses the power of CoSort and Hadoop, and the familiarity and interoperability of Eclipse -- plus plug-ins like AnalytiX DS Mapping Manager and StatET for R -- to deliver data discovery, integration, migration, governance, and analytics.

 

Click on these tabs to understand the major data and enterprise information management capabilities of Voracity. Unless designated as a third-party option with an asterisk (*), all features are included in the base license.

Job Design

Job Design Features

Details

Benefits

New job wizards

auto-creates ETL, migration, masking, test data, and reorg scripts

hand-held, step-by-step job creation

Dialog and form editors

point and click job specification from the toolbar or script

easier parm definition and modification

Job/mapping diagrams

intuitive GUI and palette for ETL mapping and workflow scripts

fast and easy job design and review

Syntax-aware editors

write, modify and validate: IRI DDF, job scripts, SQL, and Java

delight code-centric data and ETL architects

Script menus and outlines

drill-down GUI views of, and dialog interactivity with, job scripts

bypass the need to learn the 4GL

DDF and SCL metadata

self-documenting, cross-IRI-job-, MIMB- and ADSMM-compatible

easy, interoperable, batchable CLI code

EMF and XML metadata

self-documenting, re-entrant EMM infrastructure (see below)

modify jobs in any UI and update all

"Gulfstream" API/SDK

documented JARs for IRI flow and metadata to/from applications

easy EAI, easy enter/exit for IRI XML

AnalytiX DS Mapping Manager *

spreadsheet-style, code-free source-target mapping definitions

leverage existing skills and repositories

AnalytiX DS CatFx and LSC *

auto-migrate ETL tool and SQL metadata to IRI Voracity

leave megavendor ETL tools sooner

Data Discovery

Data Discovery Feature

Details

Benefits

Data Classification

define and manage enterprise-wide data class libraries

apply transformation and protection field rules across multiple sources at once

Structured data discovery

preview sequential file/DBs layouts, define IRI DDF

automated metadata creation

DDF conversion and import

CPY, CSV, LDIF, ODBC and XML to DDF migration in CLI/GUI

automated metadata conversion

DB & file profiling - statistics

select/report on column or field values, counts, lengths, duplicates, etc.

automated, custom table analysis

DB & file pattern search

finds values in tables or flat files conforming to defined Java RegEx patterns

auto-fine and protect numeric PII

DB & file string search

find explicit or dictionary-matching strings (names) in tables or flat files

auto-find and protect individuals

DB & file fuzzy search

finds near matches to defined probabilities with multiple algorithms

facilitate masking, BI, DQ, and MDM

DB integrity check

compare foreign keys with primary key in each defined column

easily maintain referential integrity

Cross-DB E-R diagramming

create and customize views of tables and relationships in any DB

visualize layouts of multiple DBs

Dark data discovery

find/report on pattern-matching values and forensic info in documents

expose dark data and metadata

Dark data structuring

extract and structure found values into flat files, create DDF and EIF

integrate/curate unstructured data

Data Integration

Data Integration Feature

Details

Benefits

Multiple source connections

manipulate and mash-up structured and unstructured sources

correlate internal and external data

Multiple target definitions

update and bulk-load: tables, files, pipes, procedures, and reports

single I/O, synchronized data

FAst extraCT (FACT) *

parallel unload Oracle, DB2, MySQL, SQL Server, Sybase, etc.

faster ETL, reorg, migration, archive

CoSort transform engine

resource-optimized, auto-tuned, single-pass sort, join, aggregate, etc.

relieves BI/DB tool, precludes more HW

CoSort (SortCL) 4GL DDL/DML

one-script / one-pass: transformation, cleansing, masking, mapping reporting

consolidates products, simplifies metadata, and saves I/Os

Hadoop transforms *

seamless options for MapReduce, Spark, Spark Stream, Storm or Tez processing

unlimited scalability, commodity servers

Data mapping

format endian, field, record, file, and tables, add surrogate keys, etc.

supports ETL, federation, replication

DB DDL and loader compatibility

automated table creation and (pre-CoSorted) load script definition

faster target table creation and loading

Data cleansing and validation

find, filter, unify, replace, validate, regulate, standardize, synthesize

high quality data = reliable ETL and BI

Data Migration

Data Migration Feature

Details

Benefits

Data type conversion

convert between alphanumeric, date/time, and multi-byte formats

speed platform migration

File format conversion

convert to/from fixed/delimited, LDIF, MF-ISAM and Vision, VB, XML, etc.

support application migration

Endian conversion

big, little and BOM recognition or change at field and file levels

ease mainframe migration

DB table conversion

profile existing tables, build and populate new tables with mapped data

facilitate DB vendor migration

Dark data structuring

search and extract pattern-matched strings from MS documents, pdf, etc.

unlock and use unstructured data

Data federation

process data in place (LDW) and/or send mashup results to the console or service apps

display ad hoc values and views without centralization or persistence

Data replication

unlocks and copies legacy data to new formats or into ETL patterns

ease the re-use of legacy data

JCL data redefinition

recognize and convert mainframe sort parms during sort migration

leave legacy sort software faster

Data Governance

Data Governance Feature

Details

Benefits

Metadata Management (EMM)

robust, unified data dictionary infrastructure supported in Eclipse

link, reconcile, and govern information

Master Data Management (MDM)

develop, define, unify, modify, bucket, use, and share master data

360º view of customers and other data

Metadata asset management

master and metadata lineage, impact analysis, version control in Git or AnalytiX DS *.

secure, shared, graphical data lineage

Data masking

static and dynamic protections in 12 categories, automatic auditing

protect and comply, secure BI/DW ops

Re-ID risk scoring

statistically measure and report on quasi-identifier risk, and add further generalization

comply with HIPAA EDM, etc.

Database subsetting

define, build, and mask subsets of production databases for testing

rapid, agile prototyping of relational data

Data quality

discover, de-duplicate, filter, fuzzy search, RI-check, standardize

improve MDM, ETL, analytic reliability

Encryption key management

built-in methods or integration with web key vault and/or HSMs

improves decryption and restoration security

Test data generation

parse, generate, and load synthetic DB, file, mart and report targets

faster, compliant EDW/app prototyping

COBIT support

data management capabilities support majority of COBIT objectives

align data management with risk control

DB DCAP (DAM/DAP) (IRI Chakra Max *)

available DB access and activity monitoring, blocking and auditing

fast, low-impact, cross-DB firewall

Compliance services

train for compliance, and work with independent, HIPAA-qualified statistician and attorneys for verification and breach insurance/defense

assess HIPAA and PCI gaps, compliance

BI & Analytics

Analytic Feature

Details

Benefits

Embedded reporting

custom detail and summary BI targets with math, transforms, masking, etc.

report while transforming

Change data capture

find inserts, updates, deletes, non-changes, and value deltas from data, not logs

multi-input, custom output

Slowly changing dimensions

update and report on changes in data using fuzzy logic

use SCD data in BI, DI

Clickstream and CDR support

transform, convert, mask, federate, and report on data in web log and ASN.1 formats

bypass mediation software

Data segmentation

custom-select and silo data during integration, masking, quality, and reporting

CDI, CRM, DLP, MDM

BIRT integration

CoSort (SortCL) data preparation runs at reporting time, feeding an IRI data source in memory for immediate BIRT display

faster, free visual BI in Eclipse

BI/analytic tool data preparation

franchise for BIRT, BOBJ, Cognos, Microstrategy, OBIEE, QV, R, Splunk, Spotfire, Tableau, etc.

remove DI from the BI layer

JupiterOne * Analytics

real-time visualizations using SQL syntax and Spark processing

immediate results, bypass ETL