Research dataset anonymization is the removal of participant identifiers under GDPR Art. 89 safeguards. anonym.plus runs on your own device. The measured variables stay usable while the people behind them stop being named.
When this applies
A study group shares a file with collaborators across borders. Direct names, emails, and rare birth dates must be hidden first.
How anonym.plus handles it
- Load the file (CSV, XLSX, or DOCX) into anonym.plus on your device.
- The tool scans columns and notes for direct identifiers.
- Local OCR reads any scanned questionnaire you include.
- Confirm the flagged names, emails, and dates.
- Replace each with a steady pseudonym across the file.
- Save the safeguarded copy locally with no network call.
What you need to provide
- The file (CSV, XLSX, DOCX, or scan).
- An operator: Replace for pseudonyms, Redact to drop a field.
- Optional: a pseudonym map kept apart for re-linking.
PHI entity types detected
| Category | anonym.plus entity type | Example |
|---|---|---|
| Names | PERSON | Ingrid Larsen → [PARTICIPANT_2] |
| EMAIL_ADDRESS | i.larsen@example.no → [EMAIL] | |
| Birth date | DATE_TIME | born 04/04/1939 → [BIRTH_YEAR] |
| Location | LOCATION | Tromsø, Norway → [REGION] |
| Phone | PHONE_NUMBER | +47 555 01 902 → [PHONE] |
| Free-text ID | ID | enrol EN-3382 → [ENROL_ID] |
Compliance achieved
- Supports the safeguards expected under GDPR Art. 89.
- Runs offline, so the tool itself needs no BAA.
- On-device AES-256-GCM protects the working copy.
- Reaches GDPR Recital 26 scope once no one can be identified.
Anonymize research datasets offline — see plans & start free →
Limitations & cautions
Art. 89 allows pseudonymised data under safeguards, which is not the same as full anonymisation. If you keep a re-linking map, the file is still personal data under the GDPR. Store that map apart and judge whether rare combinations remain.
Frequently asked questions
What does GDPR Art. 89 cover?
It sets safeguards for processing personal data for scientific study, such as pseudonymisation and data minimisation. Removing direct identifiers is a core safeguard before you share a file with collaborators.
Is pseudonymised data anonymous?
No. If a re-linking key exists, the file is still personal data under the GDPR. Full anonymisation (Recital 26) means no one can be identified by any means reasonably likely to be used.
Where should the key map live?
Apart from the shared file, on a protected store only the lead can reach. anonym.plus can produce a pseudonym map you hold separately.