Content area
Background
Electronic medical records contain information of value for research, but contain identifiable and often highly sensitive confidential information. Patient-identifiable information cannot in general be shared outside clinical care teams without explicit consent, but anonymisation/de-identification allows research uses of clinical data without explicit consent.
Results
This article presents CRATE (Clinical Records Anonymisation and Text Extraction), an open-source software system with separable functions: (1) it anonymises or de-identifies arbitrary relational databases, with sensitivity and precision similar to previous comparable systems; (2) it uses public secure cryptographic methods to map patient identifiers to research identifiers (pseudonyms); (3) it connects relational databases to external tools for natural language processing; (4) it provides a web front end for research and administrative functions; and (5) it supports a specific model through which patients may consent to be contacted about research.
Conclusions
Creation and management of a research database from sensitive clinical records with secure pseudonym generation, full-text indexing, and a consent-to-contact process is possible and practical using entirely free and open-source software.
Details
Psychiatry;
Internet;
Open source software;
Identification;
Medical research;
Relational data bases;
Natural language processing;
Databases;
Information retrieval;
Birth;
Extraction;
Machine learning;
Constitution;
Electronic medical records;
Welding;
Process control;
Biomedical materials;
Algorithms;
Competition;
Information processing;
Authentication;
Security;
World Wide Web;
Celery;
Software;
Language;
Information systems;
Windows (computer programs);
Simplification;
Cryptography;
Life cycle analysis;
Mental disorders;
Nets;
Learning algorithms;
Data processing;
Engineering;
Computer programs;
Translation;
Tables (data);
Privacy;
Scrub;
Drugs;
Embedding;
Computer architecture;
Colleges & universities;
Childbirth & labor;
Software industry;
Health Insurance Portability & Accountability Act 1996-US;
Process controls;
Cybersecurity