Clinical Letter Batch Anonymization with anonym.plus

Clean a whole folder of letters in one local run with steady labels.

Batch anonymization is the removal of PHI from a whole folder of letters in one run. It applies HIPAA Safe Harbor (45 CFR §164.514(b)) the same way across every file. anonym.plus works on your device, with a shared label map so one patient maps to one alias everywhere.

When this applies

A teaching corpus or research set means cleaning hundreds of letters at once, with the same patient or doctor in many. One-by-one work risks drift. A batch run keeps it even.

How anonym.plus handles it

  1. Point anonym.plus at the folder on your machine.
  2. It scans each letter for the full Safe Harbor ID set.
  3. A shared map keeps repeat people steady across the run.
  4. Review the summary and fix any low-confidence flags.
  5. Save the clean corpus on your device.

What you need to provide

PHI entity types detected

Categoryanonym.plus entity typeExample
NamesPERSONpatient across files → [PATIENT_1]
NamesPERSONrepeat doctor → [PROVIDER_1]
DatesDATE_TIMEletter dates → [DATE]
Record IDsMEDICAL_RECORD_NUMBERMRNs → [MRN_n]
LocationLOCATIONaddresses → [ADDRESS]
ContactEMAIL_ADDRESSemails → [EMAIL]

Compliance achieved

Anonymize clinical letters offline — see plans & start free →

Limitations & cautions

Mixed folders (some scanned, some native) lean on OCR for the image files, so review low-confidence flags from scans. A shared map keeps results even, but you must guard that map, since it can re-link the set if kept.

Frequently asked questions

How does batch mode keep one patient steady?

A shared map logs each person once, so the same patient or doctor maps to the same alias in every file across the folder.

Can the batch mix PDFs, DOCX, and scans?

Yes. Mixed types work. Scanned letters are read with local OCR before the check.

Is there a cloud quota on size?

No. Work is local, so speed depends on your hardware, not a cloud plan.