Audit-dataset de-identification is the removal of personal data from a compliance extract under UK GDPR Art. 9 & DPA 2018. anonym.plus runs on your own device. The findings stay testable, but the rows no longer name patients or staff.
When this applies
A compliance team samples charts and access entries to test a control. The sample names patients and the workers who touched each chart, and that must come out.
How anonym.plus handles it
- Point anonym.plus at the sample on your server.
- It scans ID columns and any free-text fields.
- Steady labels keep links across joined rows.
- Review the summary and adjust the column rules.
- Swap each identifier, shifting dates to keep the gaps.
- Save the clean sample. Source rows stay local.
What you need to provide
- The sample (CSV, JSON, or a file bundle).
- A column map for known ID fields.
- Replace with a steady map to keep joins.
Patient data entity types detected
| Category | anonym.plus entity type | Example |
|---|---|---|
| Patient | PERSON | subject → [PATIENT_n] |
| Staff | PERSON | reviewer → [AUDITOR_n] |
| Staff ID | ID | login bweiss → [USERNAME] |
| Dates | DATE_TIME | sampled → shifted [DATE] |
| Record IDs | MEDICAL_RECORD_NUMBER | nhs_number field → [NHS_NUMBER_n] |
| Network | IP_ADDRESS | 10.2.7.40 → [IP] |
Compliance achieved
- Strips personal data under UK GDPR Art. 9 & DPA 2018.
- Runs offline, so no supplier contract is needed.
- Keeps row links steady, so the control stays testable.
Anonymise audit datasets offline — see plans & start free →
Limitations & cautions
A compliance sample mixes tidy columns with note fields. The columns are easy to map; notes need the same care as any chart. Test a small slice first, and confirm that date-shifting keeps the gaps your test relies on.
Frequently asked questions
Does an audit sample hold personal data?
Yes. It names patients in the charts and the staff who handled them. Both are cleared so the findings can be shared without exposing people.
Can the rows still be tested after the swap?
Yes. A steady map keeps joins intact, so the control can be checked on the clean data.
Are staff logins and IPs handled?
Yes. Usernames, IPs, and timestamps in access entries are flagged alongside patient identifiers.