Onboarding batch redaction is the removal of identifiers from a whole folder in one run, guided by GDPR Recital 26. That recital sets when data no longer points to a person. anonym.plus processes up to 20 files at a time on your device, with a shared map so one worker maps to one alias everywhere.
When this applies
A joiner's folder holds forms, IDs, and checklists that name the same person. One-by-one work risks drift, so a batch run cleans the whole set evenly.
How anonym.plus handles it
- Point anonym.plus at the joiner folder on your machine.
- It scans each file for the identifier set.
- Local OCR reads any scanned pages in the set.
- A shared map keeps the worker steady across files.
- Review the summary and fix low-confidence flags.
- Save the clean set locally.
What you need to provide
- A folder of onboarding files (PDF, DOCX, mixed).
- The shared label map turned on for steady results.
- An operator (Replace works well for a joiner set).
PII entity types detected
| Category | anonym.plus entity type | Example |
|---|---|---|
| Names | PERSON | worker across files → [WORKER_1] |
| Identifiers | US_SSN | SSNs → [SSN] |
| Identifiers | NATIONAL_ID | staff IDs → [ID] |
| Contact | EMAIL_ADDRESS | emails → [EMAIL] |
| Financial | US_BANK_NUMBER | accounts → [ACCOUNT] |
| Dates | DATE_TIME | birth dates → [DOB] |
Compliance achieved
- Aligns with the anonymity test in GDPR Recital 26 across the set.
- The shared map keeps results steady file to file.
- Whole-folder work stays offline — nothing leaves your machine.
Anonymize onboarding file sets offline — see plans & start free →
Limitations & cautions
A mixed folder of scans and native files leans on OCR for the images, so review low-confidence flags. The shared map can re-link the set if kept; turn it off when you need true anonymity.
Frequently asked questions
How does batch mode keep one worker steady?
A shared map logs each person once, so the same worker maps to the same alias in every file across the folder.
How many files can one run handle?
Up to 20 files per batch, all processed locally with OCR for any scanned pages.
Does batch work upload anything?
No. The whole run is offline, so onboarding data stays on your machine.