Clinical Trial Dataset Anonymisation

Healthcare Solution · Clinical Research & Trials · MHRA / ICH E6(R2)

Trial dataset anonymisation is the removal of participant identifiers before a file is shared. It supports MHRA clinical-trial transparency requirements and ICH E6(R2). anonym.plus runs offline and keeps the measured values readable.

When this applies

The MHRA asks a sponsor to share participant-level results. Names, free-text fields, and rare dates must be hidden first.

How anonym.plus handles it

Load the file (CSV, XLSX, PDF, or DOCX) into anonym.plus.
The tool scans columns and free text for direct identifiers.
Local OCR pulls text from any scanned supporting page.
Confirm the flagged participant names, dates, and places.
Replace each one with a steady token across the whole file.
Save the cleaned copy on your device with no network call.

What you need to provide

The file (CSV, XLSX, PDF, DOCX, or scan).
An operator: Replace for tokens, Redact to drop a field.
Optional: a token map for consistent labels across exports.

Patient data entity types detected

Category	anonym.plus entity type	Example
Names	PERSON	James O’Brien → [PARTICIPANT_1]
Event dates	DATE_TIME	Randomised 02/02/2026 → [DATE]
Location	LOCATION	Manchester, England → [REGION]
Email	EMAIL_ADDRESS	j.obrien@example.co.uk → [EMAIL]
Free-text ID	ID	Screening SCR-0091 → [SCREEN_ID]
NHS number	MEDICAL_RECORD_NUMBER	NHS 512 345 6789 → [LOCAL_ID]

Compliance achieved

Supports transparency under MHRA / ICH E6(R2).
Stays offline, so no cloud data-processor contract is needed.
On-device AES-256-GCM guards the working copy.
Aligns with UK GDPR Recital 26 when the result no longer identifies anyone.

Anonymise clinical trial datasets offline — see plans & start free →

Limitations & cautions

MHRA transparency expects a risk assessment, not just field removal. The tool strips direct identifiers and flags rare values like an unusual age. You still judge whether quasi-identifiers in combination could re-identify a participant before release.

Frequently asked questions

What does MHRA transparency require for trial data?

The MHRA expects sponsors to publish or share participant-level data with identifiers removed and a documented justification of the method. Removing direct identifiers is the first step toward that submission.

Can it process spreadsheet columns?

Yes. Load a CSV or XLSX and the tool scans both column values and free text. It applies the same token to a repeated name across every row.

Is the published file still useful for analysis?

Yes. Numeric outcomes and timing offsets remain. Only direct identifiers are swapped, so the file keeps its scientific value.

Clinical Trial Dataset Anonymisation with anonym.plus