Data Blurring

Generalize or Add Noise to PII

Blurring and Generalization


Quasi-identifying values like age and date of birth, as well as descriptors like occupation and marital status, can all be used to re-identify people if there are enough of these attributes in the data set and/or the can be joined to a superset population with similar values.


For this reason, your jobs in the IRI FieldShield data masking product (or IRI Voracity data management platform) can apply one or more additional techniques to obfuscate the data, while still keeping it accurate enough for research or marketing purposes. Numeric blurring functions create random noise for specified age and date ranges.


In the example below, specific ages are bucketed into decade groups, multiple marital status attributes are combined into two broader categories in a defined condition, educational attainments are simplified through a new set lookup file, and all occupations were explicitly redacted in place.

The new result set can now be re-run through the risk scoring wizard to produce another determination of re-identification risk based on now less distinct quasi-identifying attributes.