Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

Stanford/Packard scientist’s data-mining technique strikes genetic gold

12.01.2006


A new method to mine existing scientific data may provide a wealth of information about the interactions among genes, the environment and biological processes, say researchers at the Stanford University School of Medicine, Lucile Packard Children’s Hospital and Harvard Medical School. Like panning for gold, they used the powerful technique to sift through millions of bits of unrelated information - in this case, gene expression data from so-called microarray experiments - to pinpoint genes likely to be involved in leukemia, aging, injury and muscle development.



"This is just the tip of the iceberg," said bioinformatics specialist Atul Butte, MD, PhD, who is also a pediatrician at Lucile Packard Children’s Hospital at Stanford. "Nearly 100 different diseases have been studied using microarrays, spanning all of medicine. This is a new way to explore this type of data. We can study virtually everything that’s been studied." Butte is the first author of the study, which is published in the Jan. 6 online issue of Nature Biotechnology.

The advance comes with a caveat, however: clinically useful nuggets will be buried under the avalanche of data inundating international repositories each year unless scientists come up with a way to better classify their experiments and results.


"Libraries figured out a long time ago how to classify items using the Dewey decimal and other systems," said Butte, who estimates that the contents of the databases are more than doubling each year. "We need to write software now that will help scientists assign the proper concepts to each experiment."

Microarray experiments allow researchers to compare the expression patterns of tens of thousands of individual genes over time in diseased and healthy cells, or in many other experimental conditions. Each experiment generates thousands of pieces of data about the cell’s genes. Although biologists use the technology routinely, focusing only on the few results pertinent to their particular research topic, most scientific journals require that their authors submit all of their data to international databases for use by other researchers.

Butte and his Harvard co-author, Isaac Kohane, MD, PhD, used computer programs to automatically categorize the tens of thousands of microarray experiments in a single database based on the terms, or concepts, used by the submitter to describe the experiment. They then looked for findings shared by several experiments with similar concepts, such as tissue type, for example. Comparing results from many similar experiments allowed them to identify correlations that may not be statistically significant in just one experiment.

Butte and Kohane identified several previously unknown correlations: nine genes whose expression increased or decreased significantly with aging, two genes that are highly expressed in response to injury, and another gene in which the expression drops significantly in leukemic cells. They also confirmed these relationships by studying genes known to be associated with muscle tissue in both humans and mice.

Their classification system was stymied, however, when scientists included too much or too little information in the text annotations, or used imprecise words such as "pool," which can mean either a body of water or the action of combining the contents of two or more tubes.

"As a community, we’ve standardized the way the data itself is represented," said Butte, "but there are no formal requirements for the accompanying textual descriptions of this data. Sometimes people seem to almost copy and paste their entire scientific paper into the text box. We need to clean up our annotations because now we’re showing that they have value."

Butte and Kohane favor using the existing Unified Medical Language System, which consists of more than 1 million biomedical concepts, to vastly simplify the computerized sorting of the thousands of microarray experiments submitted to databases each year. Without such a system, valuable information will simply be lost as the results pile up. The National Institutes of Health recently funded the National Center for Biomedical Ontology, a consortium led by Stanford professor Mark Musen, MD, PhD, to develop ontologies to allow scientists to describe their data in standardized ways.

"All the answers are already there," said Butte. "We’ve reached a critical mass with this data. But unless we’re careful, we’re going to end up with a big mess."

Krista Conger | EurekAlert!
Further information:
http://www.stanford.edu
http://mednews.stanford.edu
http://www.lpch.org

More articles from Life Sciences:

nachricht How brains surrender to sleep
23.06.2017 | IMP - Forschungsinstitut für Molekulare Pathologie GmbH

nachricht A new technique isolates neuronal activity during memory consolidation
22.06.2017 | Spanish National Research Council (CSIC)

All articles from Life Sciences >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Can we see monkeys from space? Emerging technologies to map biodiversity

An international team of scientists has proposed a new multi-disciplinary approach in which an array of new technologies will allow us to map biodiversity and the risks that wildlife is facing at the scale of whole landscapes. The findings are published in Nature Ecology and Evolution. This international research is led by the Kunming Institute of Zoology from China, University of East Anglia, University of Leicester and the Leibniz Institute for Zoo and Wildlife Research.

Using a combination of satellite and ground data, the team proposes that it is now possible to map biodiversity with an accuracy that has not been previously...

Im Focus: Climate satellite: Tracking methane with robust laser technology

Heatwaves in the Arctic, longer periods of vegetation in Europe, severe floods in West Africa – starting in 2021, scientists want to explore the emissions of the greenhouse gas methane with the German-French satellite MERLIN. This is made possible by a new robust laser system of the Fraunhofer Institute for Laser Technology ILT in Aachen, which achieves unprecedented measurement accuracy.

Methane is primarily the result of the decomposition of organic matter. The gas has a 25 times greater warming potential than carbon dioxide, but is not as...

Im Focus: How protons move through a fuel cell

Hydrogen is regarded as the energy source of the future: It is produced with solar power and can be used to generate heat and electricity in fuel cells. Empa researchers have now succeeded in decoding the movement of hydrogen ions in crystals – a key step towards more efficient energy conversion in the hydrogen industry of tomorrow.

As charge carriers, electrons and ions play the leading role in electrochemical energy storage devices and converters such as batteries and fuel cells. Proton...

Im Focus: A unique data centre for cosmological simulations

Scientists from the Excellence Cluster Universe at the Ludwig-Maximilians-Universität Munich have establised "Cosmowebportal", a unique data centre for cosmological simulations located at the Leibniz Supercomputing Centre (LRZ) of the Bavarian Academy of Sciences. The complete results of a series of large hydrodynamical cosmological simulations are available, with data volumes typically exceeding several hundred terabytes. Scientists worldwide can interactively explore these complex simulations via a web interface and directly access the results.

With current telescopes, scientists can observe our Universe’s galaxies and galaxy clusters and their distribution along an invisible cosmic web. From the...

Im Focus: Scientists develop molecular thermometer for contactless measurement using infrared light

Temperature measurements possible even on the smallest scale / Molecular ruby for use in material sciences, biology, and medicine

Chemists at Johannes Gutenberg University Mainz (JGU) in cooperation with researchers of the German Federal Institute for Materials Research and Testing (BAM)...

All Focus news of the innovation-report >>>

Anzeige

Anzeige

Event News

Plants are networkers

19.06.2017 | Event News

Digital Survival Training for Executives

13.06.2017 | Event News

Global Learning Council Summit 2017

13.06.2017 | Event News

 
Latest News

Quantum thermometer or optical refrigerator?

23.06.2017 | Physics and Astronomy

A 100-year-old physics problem has been solved at EPFL

23.06.2017 | Physics and Astronomy

Equipping form with function

23.06.2017 | Information Technology

VideoLinks
B2B-VideoLinks
More VideoLinks >>>