Dataset batch redaction is the removal of personal data from many capital-markets files in one run under SEC Reg S-P. The privacy rule (17 CFR 248) governs how a firm safeguards customer data. anonym.plus processes up to 20 files at a time on your machine, with a shared map so one party maps to one alias.
When this applies
A deal archive holds rosters, blotters, and statements that name the same clients. One-by-one work risks drift, so a batch run applies the rule evenly.
How anonym.plus handles it
- Point anonym.plus at the archive folder on your machine.
- It scans each file for names, IDs, and account fields.
- Local OCR reads any scanned pages in the set.
- A shared map keeps repeat clients steady across files.
- Review the summary and fix low-confidence flags.
- Save the clean set locally.
What you need to provide
- A folder of datasets (CSV, PDF, DOCX, mixed).
- The shared label map turned on for steady results.
- An operator (Mask suits structured fields).
PII & financial identifiers detected
| Category | anonym.plus entity type | Example |
|---|---|---|
| Names | PERSON | client across files → [CLIENT_1] |
| Identifiers | US_SSN | tax SSNs → [SSN] |
| Financial | US_BANK_NUMBER | accounts → [ACCOUNT] |
| Financial | IBAN_CODE | wire IBANs → [IBAN] |
| Contact | EMAIL_ADDRESS | emails → [EMAIL] |
| Financial | MONEY | amounts → [AMOUNT] |
Compliance achieved
- Supports the safeguards rule in SEC Reg S-P (17 CFR 248).
- The shared map keeps results steady across the set.
- Whole-folder work stays offline — nothing uploads.
Anonymize capital-markets datasets offline — see plans & start free →
Limitations & cautions
A mixed folder of scans and native files leans on OCR for the images, so review low-confidence flags. The shared map can re-link the set if kept; turn it off when you need true anonymity.
Frequently asked questions
How does batch mode keep one client steady?
A shared map logs each person once, so the same client maps to the same alias in every file across the folder.
How many files can one run handle?
Up to 20 files per batch, all processed locally with OCR for any scanned pages.
Does batch work upload anything?
No. The whole run is offline, so customer data stays on your machine.