Patient privacy assured by electronic censor

Patient records that are to be shared within the research community must have any identifying information removed. Manual removal of identifying information is prohibitively expensive and time consuming. Considerable research by many investigators has focussed on developing automated techniques for “de-identifying” medical records.

A team from the Massachusetts Institute of Technology (MIT) funded by the National Institutes of Health (NIH) aimed to solve this problem, pointing out that: “Text-based patient medical records are a vital resource in research. The expense of manual de-identification, coupled with the fact that it is time-consuming and prone to error, necessitates automatic methods for large-scale de-identification.”

The MIT team tested their censoring software on a meticulously hand-annotated database of 1836 nursing notes (a total of 296,400 words). According to the authors, “The software successfully deleted more than 94% of the confidential information, while wrongly deleting only 0.2% of the useful content. This is significantly better than one expert working alone, at least as good as two trained medical professionals checking each other’s work and many, many times faster than either.”

The MIT team is also providing access to the fully-scrubbed annotated data together with the software to allow others to improve their systems, and to allow the software to be adapted to other data types that may exhibit different qualities.

Media Contact

Graeme Baldwin alfa

More Information:

http://www.biomedcentral.com

All latest news from the category: Information Technology

Here you can find a summary of innovations in the fields of information and data processing and up-to-date developments on IT equipment and hardware.

This area covers topics such as IT services, IT architectures, IT management and telecommunications.

Back to home

Comments (0)

Write a comment

Newest articles

A universal framework for spatial biology

SpatialData is a freely accessible tool to unify and integrate data from different omics technologies accounting for spatial information, which can provide holistic insights into health and disease. Biological processes…

How complex biological processes arise

A $20 million grant from the U.S. National Science Foundation (NSF) will support the establishment and operation of the National Synthesis Center for Emergence in the Molecular and Cellular Sciences (NCEMS) at…

Airborne single-photon lidar system achieves high-resolution 3D imaging

Compact, low-power system opens doors for photon-efficient drone and satellite-based environmental monitoring and mapping. Researchers have developed a compact and lightweight single-photon airborne lidar system that can acquire high-resolution 3D…

Partners & Sponsors