Healthcare, Pharmaceuticals, R&D


Big Data Transformation & PHI De-ID Solutions

Big Data Challenges in Healthcare


There are several big data processing and data-centric security challenges in biomedicine and healthcare today, including:

  • health insurance companies running promotional and fraud detection data warehouses which need fast integration, cleansing, reformatting, and analysis of claim data
  • genomic data transformation and analytics
  • pharmaceutical companies manipulating and reporting on sales, manufacturing, and supply chain data
  • HIPAA-covered entities and business associates like marketers or processors of patient and prescription databases who need to find, de-ID, and audit Protected Health Information (PHI) to comply with the Safe Harbor Rule, and/or score the risk of re-identification to comply with the Expert Determination Method rule
  • DBAs and developers requiring safe, intelligent test data

Data Preparation Challenges in Healthcare

Learn more about these data preparation challenges specific to the healthcare industry in this Q3'2023 podcast from Bloor Research.

IRI software applicable to data-driven use cases in the healthcare sector includes:

  • IRI Voracity - to discover, integrate, govern (mask, risk-score, prototype, track), and analyze ePHI, leveraging all of the functions in these products which are also available standalone:
  • IRI FACT (Fast Extract) - to rapidly unload Oracle, MS SQL, and other DBs, and run in Voracity ETL jobs
  • IRI CoSort (SortCL program) - to optimize and combine transformation and reporting on flat and semi-structured data
  • IRI FieldShield - to redact, encrypt, pseudonymize, and otherwise de-ID, risk-score and anonymize PHI/PII in RDBs and flat files
  • IRI CellShield - to discover, then mask, encrypt, and pseudonymize (and audit) PHI in Excel® spreadsheets
  • IRI DarkShield - to discover, deliver, delete and mask 'hidden' PHI not only in structured sources, but also in semi-structured and unstructured text, HL7/X12, DICOM, document, and NoSQLsources
  • IRI RowGen - to populate test DBs and files with production-quality synthetic data (without requiring real data) or masked DB subsets in tables and files, plus HL7, X12, FHIR, etc.

IRI also works with experts in healthcare analytics, like Scalable Healthcare, and teaches HIPAA SH/EDM compliance with statisticians and attorneys in courses like this. IRI also integrates and partners with leading test data management portals and databse virtualization solutions.


You can learn more about the use of IRI Voracity platform solutions above in healthcare from the 2023 Bloor Research InContext report.

IRI customers in this broad category include:

  • Accenture
  • AIM Health
  • Aon Hewitt
  • Appistry
  • Assubel
  • Aventis
  • Aviva
  • Blue Cross Blue Shield
  • BUPA
  • Cigna
  • Codman Group
  • Esai Pharmaceuticals
  • Excelus
  • General Dynamics (ViPS)
  • Geisinger Health
  • Genome Institute of Singapore
  • Health Alliance Plan
  • Health Canada
  • Highmark Health
  • Lexis/Nexis (EDIWatch)
  • Mayo Clinic
  • McKesson
  • MedicX
  • Nevada Health
  • Northwell Health
  • Physicians Mutual
  • Pinnacol Assurance
  • Regence Group
  • Sato Pharmaceuticals
  • Segal Health Benefits
  • Singapore Ministry of Health
  • Texas Tech Health Sciences
  • United Healthcare
  • Unlimited Care
  • Wellpoint