Audit-dataset de-identification is the removal of all 18 HIPAA Safe Harbor IDs (45 CFR §164.514(b)) from a compliance extract. anonym.plus runs on your own device. The findings stay testable, but the rows no longer name patients or staff.
When this applies
A compliance team samples charts and access entries to test a control. The sample names patients and the workers who touched each chart, and that must come out.
How anonym.plus handles it
- Point anonym.plus at the sample on your server.
- It scans ID columns and any free-text fields.
- Steady labels keep links across joined rows.
- Review the summary and adjust the column rules.
- Swap each ID, shifting dates to keep the gaps.
- Save the clean sample. Source rows stay local.
What you need to provide
- The sample (CSV, JSON, or a file bundle).
- A column map for known ID fields.
- Replace with a steady map to keep joins.
PHI entity types detected
| Category | anonym.plus entity type | Example |
|---|---|---|
| Patient | PERSON | subject → [PATIENT_n] |
| Staff | PERSON | reviewer → [AUDITOR_n] |
| Staff ID | ID | login bweiss → [USERNAME] |
| Dates | DATE_TIME | sampled → shifted [DATE] |
| Record IDs | MEDICAL_RECORD_NUMBER | mrn field → [MRN_n] |
| Network | IP_ADDRESS | 10.2.7.40 → [IP] |
Compliance achieved
- Strips all 18 ID classes for HIPAA Safe Harbor (45 CFR §164.514(b)).
- Runs offline, so the tool itself needs no BAA.
- Keeps row links steady, so the control stays testable.
Anonymize audit datasets offline — see plans & start free →
Limitations & cautions
A compliance sample mixes tidy columns with note fields. The columns are easy to map; notes need the same care as any chart. Test a small slice first, and confirm that date-shifting keeps the gaps your test relies on.
Frequently asked questions
Does an audit sample hold PHI?
Yes. It names patients in the charts and the staff who handled them. Both are cleared so the findings can be shared without exposing people.
Can the rows still be tested after the swap?
Yes. A steady map keeps joins intact, so the control can be checked on the clean data.
Are staff logins and IPs handled?
Yes. Usernames, IPs, and timestamps in access entries are flagged alongside patient IDs.