Map Once, Deploy Anywhere

Use CoSort or Hadoop: Same Job, Metadata, GUI

When it comes to processing big data, most people think they need another IT fabric like Hadoop or Teradata, an in-memory or columnar database like SAP HANA or Vertica, a DB or ELT appliance like Exadata or Netezza, or a complex ETL tool like Informatica or Ab Initio. Do you have the time, money and expertise to take that on?


What if there were a proven, much more affordable, high-performance data processing and governance platform built on Eclipse that exploited existing file system and HDFS resources (data and engines) interchangeably?

IRI Insights Success Cover Story Award


Click the Cover Stoy to see the seamless processing choices that only IRI Voracity delivers for big data transformation, masking, and generation.

With Hadoop

IRI Voracity leverages the performance, scalability, load balancing, and automatic failover capabilities of MapReduce 2 (MR2), Spark, Spark Stream, Storm, or Tez. Voracity runs most CoSort (SortCL) jobs in these engines based on availability and need. Voracity works in Cloudera, HortonWorks, and MapR distributions. IRI will also provide its own Hadoop distribution on-premise or in the cloud, and later in a hardware appliance that includes everything. This article shows how to run Voracity jobs in Hadoop.

Without Hadoop

IRI CoSort, the proven big data processing alternative to Hadoop, provides fast, intuitive, inexpensive, and non-disruptive processes. It removes the skills gap and support costs around Hadoop, and it does not require the time, money, or manpower to leverage big data. CoSort is a low-cost, low-impact, and low-risk option essential for small and medium-sized business or enterprise line of business teams that love even its multi-terabyte performance.

Whether your data sources are in a standard Unix, Linux or Windows file system, in HDFS, or managed in the proprietary systems above, you can manage it all faster and cheaper in Voracity using a common metadata and free Eclipse IDE called IRI Workbench.

  • What is Hadoop?

    Hadoop is an increasingly popular computing environment for distributed processing that business can use to analyze and store huge amounts of data.

    Read the Article.

  • A Big Data Quandry

    Big data volumes are growing exponentially, and simply throwing hardware at it isn't a complete or reliable long-term solution. IRI's proven strategies and software, however, are.

    Read the Article.

  • When to Use Hadoop?

    Hadoop isn't a one-size-fits-all framework. You need to know when and how to use it. Voracity makes short work of Hadoop job design and deployment when you need it.

    Read the Article.

With Voracity, it's no longer a matter of homogeneous data processed heterogeneously, or vice versa. It's about having a seamless, unified, metadata-driven enterprise information architecture ... one that gives you control over heterogeneous data sources and processing engines ... and one that meets changing data integration, governance, and analytic needs.