Scientists from the Helmholtz Zentrum München have developed a program that is able to help manage enormous datasets. The software, named Scanpy, is a candidate for analyzing the Human Cell Atlas, and has recently been published in ‘Genome Biology’.
“It’s about analyzing gene-expression data* of a large number of individual cells,” explains lead author Alex Wolf of the Institute of Computational Biology (ICB) at Helmholtz Zentrum München. He developed Scanpy together with his colleague Philipp Angerer in the Machine Learning Group of Prof. Dr. Dr. Fabian Theis. In addition to his position at Helmholtz Zentrum, Theis is also a professor of mathematical modelling of biological systems at the Technical University of Munich.
“New technical advances generate several orders of magnitude more data with a correspondingly greater information content,” Theis says. “However, the historically evolved software infrastructure for gene-expression analysis simply wasn’t designed to cope with the new challenges. New analytic methods are therefore needed.”
The race for the Human Cell Atlas
According to Theis, a major international research project could also benefit from the software. A team of international scientists is compiling a reference database, called the Human Cell Atlas, which holds data on the gene activity of all human cell types. “For this project, and in a growing number of other projects in which databases are combined, it is important to have scalable software,” says Theis. It is therefore no surprise that Scanpy is currently a candidate for helping to analyze the Human Cell Atlas (https://www.humancellatlas.org/).
“The publication of Scanpy marks the first software that allows comprehensive analysis of large gene-expression datasets with a broad range of machine-learning and statistical methods,” explains Wolf, describing the achievement. “The software is already being used by a number of groups around the world, notably at the Broad Institute of Harvard University and the Massachusetts Institute of Technology, MIT.”
Technologically, the application is a trailblazing development: Whereas biostatistics programs are traditionally written in the programming language R, Scanpy is based on the Python language, the dominant language in the machine learning community. Another new feature is that graph-based algorithms lie at the heart of Scanpy.
Unlike the usual approach of regarding cells as points in a coordinate system within gene-expression space, the algorithms use a graph-like coordinate system. Instead of characterizing a single cell by the expression value for thousands of genes, the system simply characterizes cells by identifying their closest neighbors – very much like the connections in social networks. In fact, to identify cell types, Scanpy uses the same algorithms as Facebook does for identifying communities.
* Expression describes how often a gene is read, i.e. it provides information on the gene’s activity.
Alex Wolf and his team have only recently occupied one of the top places in the Data Science Bowl, one of the world’s highest endowed competitions in the field of big data. For its entry, the team programmed an algorithm that can detect lung cancer within a few milliseconds on the basis of 300 layers of a three-dimensional CAT scan – a process that can take a radiologist, in the worst case, up to several hours.
In addition, the team has recently published an article in ‘Nature Communications’ on the reconstruction of cellular development processes from individual images: Paint by numbers: Algorithm reconstructs processes from individual images. https://www.helmholtz-muenchen.de/presse-medien/pressemitteilungen/alle-pressemi...
Wolf, A. et al. (2018): Scanpy: large-scale single-cell gene expression data analysis. Genome Biology, DOI: 10.1186/s13059-017-1382-0
The Helmholtz Zentrum München, the German Research Center for Environmental Health, pursues the goal of developing personalized medical approaches for the prevention and therapy of major common diseases such as diabetes and lung diseases. To achieve this, it investigates the interaction of genetics, environmental factors and lifestyle. The Helmholtz Zentrum München is headquartered in Neuherberg in the north of Munich and has about 2,300 staff members. It is a member of the Helmholtz Association, a community of 18 scientific-technical and medical-biological research centers with a total of about 37,000 staff members. http://www.helmholtz-muenchen.de/en
The Institute of Computational Biology (ICB) develops and applies methods for the model-based description of biological systems, using a data-driven approach by integrating information on multiple scales ranging from single-cell time series to large-scale omics. Given the fast technological advances in molecular biology, the aim is to provide and collaboratively apply innovative tools with experimental groups in order to jointly advance the understanding and treatment of common human diseases. http://www.helmholtz-muenchen.de/icb
Contact for the media:
Department of Communication, Helmholtz Zentrum München - German Research Center for Environmental Health, Ingolstädter Landstr. 1, 85764 Neuherberg - Tel. +49 89 3187 2238 - Fax: +49 89 3187 3324 - E-mail: email@example.com
Dr. Dr. Alexander Wolf, Helmholtz Zentrum München - German Research Center for Environmental Health, Institute of Computational Biology, Ingolstädter Landstr. 1, 85764 Neuherberg - Tel. +49 89 3187 4217, E-mail: firstname.lastname@example.org
Sonja Opitz | Helmholtz Zentrum München - Deutsches Forschungszentrum für Gesundheit und Umwelt
Novel PET imaging agent could help guide therapy for brain diseases
03.04.2018 | Society of Nuclear Medicine and Molecular Imaging
New Computer Architecture: Time Lapse for Dementia Research
29.03.2018 | Deutsches Zentrum für Neurodegenerative Erkrankungen e.V. (DZNE)
At the Hannover Messe 2018, the Bundesanstalt für Materialforschung und-prüfung (BAM) will show how, in the future, astronauts could produce their own tools or spare parts in zero gravity using 3D printing. This will reduce, weight and transport costs for space missions. Visitors can experience the innovative additive manufacturing process live at the fair.
Powder-based additive manufacturing in zero gravity is the name of the project in which a component is produced by applying metallic powder layers and then...
Physicists at the Laboratory for Attosecond Physics, which is jointly run by Ludwig-Maximilians-Universität and the Max Planck Institute of Quantum Optics, have developed a high-power laser system that generates ultrashort pulses of light covering a large share of the mid-infrared spectrum. The researchers envisage a wide range of applications for the technology – in the early diagnosis of cancer, for instance.
Molecules are the building blocks of life. Like all other organisms, we are made of them. They control our biorhythm, and they can also reflect our state of...
University of Connecticut researchers have created a biodegradable composite made of silk fibers that can be used to repair broken load-bearing bones without the complications sometimes presented by other materials.
Repairing major load-bearing bones such as those in the leg can be a long and uncomfortable process.
Study published in the journal ACS Applied Materials & Interfaces is the outcome of an international effort that included teams from Dresden and Berlin in Germany, and the US.
Scientists at the Helmholtz-Zentrum Dresden-Rossendorf (HZDR) together with colleagues from the Helmholtz-Zentrum Berlin (HZB) and the University of Virginia...
Novel highly efficient and brilliant gamma-ray source: Based on model calculations, physicists of the Max PIanck Institute for Nuclear Physics in Heidelberg propose a novel method for an efficient high-brilliance gamma-ray source. A giant collimated gamma-ray pulse is generated from the interaction of a dense ultra-relativistic electron beam with a thin solid conductor. Energetic gamma-rays are copiously produced as the electron beam splits into filaments while propagating across the conductor. The resulting gamma-ray energy and flux enable novel experiments in nuclear and fundamental physics.
The typical wavelength of light interacting with an object of the microcosm scales with the size of this object. For atoms, this ranges from visible light to...
13.04.2018 | Event News
12.04.2018 | Event News
09.04.2018 | Event News
24.04.2018 | Information Technology
24.04.2018 | Earth Sciences
24.04.2018 | Life Sciences