AI method uses transformer models to study human cells

A new algorithm developed by researchers in Carnegie Mellon University's Computational Biology Department called subcellular spatial transcriptomics cell segmentation (SCS) harnesses AI and advanced deep neural networks to adaptively identify cells and their constituent parts.
Credit: Carnegie Mellon University

Researchers in Carnegie Mellon University’s School of Computer Science have developed a method that uses artificial intelligence to augment how cells are studied and could help scientists better understand and eventually treat disease.

Images of organ or tissue samples contain millions of cells. And while analyzing these cells in situ is an important part of biological research, such images make it nearly impossible to identify individual cells, determine their function and understand their organization. A technique called spatial transcriptomics brings these cells into focus by combining imaging with the ability to quantify the level of genes in each cell — giving researchers the ability to study in detail several key biological mechanisms, ranging from how immune cells fight cancer to the cellular impact of drugs and aging.

Many current spatial transcriptomics platforms still lack the resolution required for closer, more detailed analysis. These technologies often group cells in clusters that range from several to 50 cells for each measurement, a resolution that may be sufficient for well-represented large cells but that is problematic for small cells or ones that aren’t well represented. These rare cells may be the most critical for the disease or condition being studied.

In a new paper published in Nature MethodsComputational Biology Department researchers Hao Chen, Dongshunyi Li and Ziv Bar-Joseph unveiled a method that uses artificial intelligence to augment the latest spatial transcriptomics technologies.

The CMU research focuses on more recent technologies that produce images at a much closer scale, allowing for subcellular resolution (or multiple measurements per cell). While these techniques solve the resolution issue, they present new challenges because the resulting images are so close-up that rather than capturing 15 to 50 cells per image, they capture only a few genes. This reversal of the previous problem creates difficulties in identifying the individual components and determining how to group these measurements to learn about specific cells. It also obscures the big picture.

The algorithm developed by the CBD researchers, called subcellular spatial transcriptomics cell segmentation (SCS), harnesses AI and advanced deep neural networks to adaptively identify cells and their constituent parts. SCS uses transformer models, similar to those used by large language models like ChatGPT, to gather information from the area surrounding each measurement. Just as ChatGPT uses the entire context of a sentence or paragraph for word completion, the SCS method fills in missing information for a specific measurement by incorporating information from the cells around it.

When applied to images of brain and liver samples with hundreds of thousands of cells, SCS accurately identified the exact location and type of each cell. SCS also identified several cells missed by current analysis approaches, such as rare and small cells that may play a crucial role in specific diseases or processes, including aging. SCS also provided information on location of molecules within cells, greatly improving the resolution at which researchers can study cellular organization.

“The ability to use the most recent advances in AI to aid the study of the human body opens the door to several downstream applications of spatial transcriptomics to improve human health,” said Ziv Bar-Joseph, the FORE Systems Professor of Machine Learning and Computational Biology at CMU. Such downstream applications are already being investigated by several large consortiums, including the Human BioMolecular Atlas Program (HuBMAP), that are using spatial transcriptomics to create a detailed, 3D map of the human body.

“By integrating state-of the-art biotechnology and AI, SCS helps unlock several open questions about cellular organization that are key to our ability to understand, and ultimately treat, disease,” added Hao Chen, a Lane Postdoctoral Fellow in CBD.

SCS is available free on GitHub and was supported by grants from the National Institutes of Health and the National Science Foundation. The paper, “SCS: Cell Segmentation for High-Resolution Spatial Transcriptomics,” is available on Nature Methods.

Journal: Nature Methods
DOI: 10.1038/s41592-023-01939-3
Article Title: SCS: cell segmentation for high-resolution spatial transcriptomics
Article Publication Date: 10-Jul-2023

Media Contact

Aaron Aupperlee
Carnegie Mellon University
aaupperlee@cmu.edu
Office: 412-268-9068

Media Contact

Aaron Aupperlee
Carnegie Mellon University

All latest news from the category: Information Technology

Here you can find a summary of innovations in the fields of information and data processing and up-to-date developments on IT equipment and hardware.

This area covers topics such as IT services, IT architectures, IT management and telecommunications.

Back to home

Comments (0)

Write a comment

Newest articles

The Microprocessor inside you

It’s a big year for microRNAs. The 2024 Nobel Prize in Physiology or Medicine went to Victor Ambros and Gary Ruvkun, who discovered the first microRNA in 1993. Today, we…

To build better fiber optic cables, ask a clam

Heart cockles have shells with built-in skylights to let in light for symbiotic algae. Since the first fiber optic cables rolled out in the 1970s, they’ve become a major part…

Lifesaver under the skin

Defibrillator with sternum electrode implanted for the first time at Göttingen University Medical Center to prevent sudden cardiac death. At the Heart Center of the University Medical Center Göttingen (UMG),…