Diligence Dataset Batch Anonymization with anonym.plus

Clean a whole folder of deal documents in one local run with steady labels.

Batch anonymization is the removal of PII from a whole folder of deal documents in one run. It applies GDPR Recital 26 the same way to each one. anonym.plus works on your device, with a shared map so one party maps to one label everywhere.

When this applies

A full deal space means cleaning hundreds of items at once, with the same staff or counterparty in many. One-by-one work risks drift. A batch run keeps it even.

How anonym.plus handles it

  1. Point anonym.plus at the folder on your machine.
  2. It scans each item for names, IDs, and account data.
  3. A shared map keeps repeat parties steady across the run.
  4. Review the summary and fix any low-confidence flags.
  5. Save the clean set on your device.

What you need to provide

PII entity types detected

Categoryanonym.plus entity typeExample
NamesPERSONparty across files → [PARTY_1]
OrgORGANIZATIONrepeat firm → [ENTITY_1]
ContactEMAIL_ADDRESSemails → [EMAIL]
FinanceIBAN_CODEaccounts → [ACCOUNT_n]
IdentifiersNATIONAL_IDtax ids → [ID_n]
LocationLOCATIONaddresses → [ADDRESS]

Compliance achieved

Anonymize diligence data sets offline — see plans & start free →

Limitations & cautions

Mixed folders (some scanned, some native) lean on OCR for image pages, so review low-confidence flags from scans. The shared map keeps results even, but you must guard it, since it can re-link the set if kept.

Frequently asked questions

How does batch mode keep one party steady?

A shared map logs each party once, so the same person or firm maps to the same label in every document across the folder.

How many items can one batch hold?

A batch runs up to 20 at a time. Larger spaces split into runs, each on your device.

Can the batch mix PDFs, DOCX, and scans?

Yes. Mixed types work. Scanned pages are read with local OCR before the check.