Content area

Abstract

Background

Electronic medical records contain information of value for research, but contain identifiable and often highly sensitive confidential information. Patient-identifiable information cannot in general be shared outside clinical care teams without explicit consent, but anonymisation/de-identification allows research uses of clinical data without explicit consent.

Results

This article presents CRATE (Clinical Records Anonymisation and Text Extraction), an open-source software system with separable functions: (1) it anonymises or de-identifies arbitrary relational databases, with sensitivity and precision similar to previous comparable systems; (2) it uses public secure cryptographic methods to map patient identifiers to research identifiers (pseudonyms); (3) it connects relational databases to external tools for natural language processing; (4) it provides a web front end for research and administrative functions; and (5) it supports a specific model through which patients may consent to be contacted about research.

Conclusions

Creation and management of a research database from sensitive clinical records with secure pseudonym generation, full-text indexing, and a consent-to-contact process is possible and practical using entirely free and open-source software.

Details

1009240
Title
Clinical records anonymisation and text extraction (CRATE): an open-source software system
Volume
17
Publication year
2017
Publication date
2017
Publisher
Springer Nature B.V.
Place of publication
London
Country of publication
Netherlands
e-ISSN
14726947
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
ProQuest document ID
1893831324
Document URL
https://www.proquest.com/scholarly-journals/clinical-records-anonymisation-text-extraction/docview/1893831324/se-2?accountid=208611
Copyright
Copyright BioMed Central 2017
Last updated
2024-06-26
Database
ProQuest One Academic