Capital Markets Dataset Batch Redaction with anonym.plus

Clean a whole folder of deal datasets in one local run with steady labels.

Dataset batch redaction is the removal of personal data from many capital-markets files in one run under SEC Reg S-P. The privacy rule (17 CFR 248) governs how a firm safeguards customer data. anonym.plus processes up to 20 files at a time on your machine, with a shared map so one party maps to one alias.

When this applies

A deal archive holds rosters, blotters, and statements that name the same clients. One-by-one work risks drift, so a batch run applies the rule evenly.

How anonym.plus handles it

  1. Point anonym.plus at the archive folder on your machine.
  2. It scans each file for names, IDs, and account fields.
  3. Local OCR reads any scanned pages in the set.
  4. A shared map keeps repeat clients steady across files.
  5. Review the summary and fix low-confidence flags.
  6. Save the clean set locally.

What you need to provide

PII & financial identifiers detected

Categoryanonym.plus entity typeExample
NamesPERSONclient across files → [CLIENT_1]
IdentifiersUS_SSNtax SSNs → [SSN]
FinancialUS_BANK_NUMBERaccounts → [ACCOUNT]
FinancialIBAN_CODEwire IBANs → [IBAN]
ContactEMAIL_ADDRESSemails → [EMAIL]
FinancialMONEYamounts → [AMOUNT]

Compliance achieved

Anonymize capital-markets datasets offline — see plans & start free →

Limitations & cautions

A mixed folder of scans and native files leans on OCR for the images, so review low-confidence flags. The shared map can re-link the set if kept; turn it off when you need true anonymity.

Frequently asked questions

How does batch mode keep one client steady?

A shared map logs each person once, so the same client maps to the same alias in every file across the folder.

How many files can one run handle?

Up to 20 files per batch, all processed locally with OCR for any scanned pages.

Does batch work upload anything?

No. The whole run is offline, so customer data stays on your machine.