Batch anonymization is the removal of PII from a whole folder of deal documents in one run. It applies GDPR Recital 26 the same way to each one. anonym.plus works on your device, with a shared map so one party maps to one label everywhere.
When this applies
A full deal space means cleaning hundreds of items at once, with the same staff or counterparty in many. One-by-one work risks drift. A batch run keeps it even.
How anonym.plus handles it
- Point anonym.plus at the folder on your machine.
- It scans each item for names, IDs, and account data.
- A shared map keeps repeat parties steady across the run.
- Review the summary and fix any low-confidence flags.
- Save the clean set on your device.
What you need to provide
- A folder of documents (PDF, DOCX, XLSX, or mixed).
- The shared label map turned on for steady results.
- An operator (Replace works well for a corpus).
PII entity types detected
| Category | anonym.plus entity type | Example |
|---|---|---|
| Names | PERSON | party across files → [PARTY_1] |
| Org | ORGANIZATION | repeat firm → [ENTITY_1] |
| Contact | EMAIL_ADDRESS | emails → [EMAIL] |
| Finance | IBAN_CODE | accounts → [ACCOUNT_n] |
| Identifiers | NATIONAL_ID | tax ids → [ID_n] |
| Location | LOCATION | addresses → [ADDRESS] |
Compliance achieved
- Applies GDPR Recital 26 the same way to every item.
- The shared map keeps results steady across the run.
- Whole-folder work stays offline — nothing uploaded. Batch up to 20 at a time.
Anonymize diligence data sets offline — see plans & start free →
Limitations & cautions
Mixed folders (some scanned, some native) lean on OCR for image pages, so review low-confidence flags from scans. The shared map keeps results even, but you must guard it, since it can re-link the set if kept.
Frequently asked questions
How does batch mode keep one party steady?
A shared map logs each party once, so the same person or firm maps to the same label in every document across the folder.
How many items can one batch hold?
A batch runs up to 20 at a time. Larger spaces split into runs, each on your device.
Can the batch mix PDFs, DOCX, and scans?
Yes. Mixed types work. Scanned pages are read with local OCR before the check.