Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

Computer scientists develop tool for mining genomic data

16.02.2004


Equipped with cutting-edge techniques to track the activity of tens of thousands of genes in a single experiment, biologists now face a new challenge - determining how to analyze this tidal wave of data. Stanford Associate Professor of Computer Science Daphne Koller and her colleagues have come to the rescue with a strategic approach that reduces the trial-and-error aspect of genetic sequence analysis.



’’What we’re developing is a suite of computational tools that take reams of data and automatically extract a picture of what’s happening in the cell,’’ says Koller. ’’It tells you where to look for good biology.’’

Koller presented her statistical approach for mining genomic data at a Feb. 14 symposium - ’’Machine Learning in the Sciences’’ - at the annual meeting of the American Association for the Advancement of Science (AAAS) in Seattle.


Several years ago, before Koller came onto the scene, a new generation of high-throughput assays revolutionized molecular biology. In the most stunning example of this technology, scientists began using thumbnail-sized ’’gene chips’’ to monitor the activities of thousands of genes at once. In October 2003, Santa Clara-based Affymetrix took this breakthrough to a new level when it began marketing whole-genome chips packed with all 30,000 to 50,000 known human genes. Genome chips can reveal, for instance, that in kidney cells treated with a certain drug, 116 genes spring into action while another 255 get shut off.

But this state-of-the-art DNA microarray technology provides only a single snapshot of the cell. ’’It’s a very partial view,’’ Koller says.

What scientists really want to know is how groups of genes work together to control specific biological processes, such as muscle development or cancer progression. Unraveling these regulatory networks - for example, determining that Gene A gets activated by Gene B but repressed by Gene C - is a daunting task.

Sifting through whopping amounts of DNA microarray data to cull the hundreds of activator and repressor candidates is actually the easy part. The real challenge is figuring out which of these genes, if any, are biologically meaningful. This requires a bewildering array of hit-or-miss wet-lab experiments that examine protein-protein and protein-DNA interactions among the candidate genes.

Koller’s computational tools will make this scheme less formidable by providing scientists with targeted hypotheses in the form of ’’Gene A regulates Gene B under Condition C.’’ These predictions are generated from a probabilistic framework that integrates data from a variety of sources, including microarrays, DNA sequences, and protein-protein and protein-DNA interactions.

As Koller sees it, each of these sources offers a glimpse into what is happening in the cell: ’’a snapshot from this angle, a shot from another angle, data from a third, and so on.’’ Her computational scheme creates ’’the best picture we can construct from putting all of these snapshots together.’’

The proof of concept for Koller’s targeted hypotheses came in a June 2003 Nature Genetics publication, which described the application of her tools to predict gene regulatory networks in a variety of biological processes in yeast. Three of these predictions were confirmed in wet-lab experiments, suggesting regulatory roles for previously uncharacterized proteins.

’’The creativity and computer science perspective brought to these problems by Koller and her collaborators provide a tremendous boost to biology,’’ says Matthew Scott, a developmental biologist at Stanford and chair of the scientific leadership council of Bio-X, an interdisciplinary initiative. His research group has used Koller’s approach to identify genes involved in specific processes during embryonic development, to determine which genes are key regulators of other genes and to track changes in gene activities during disease progression.

Scott adds that while the computational methods suggest interesting hypotheses, their ultimate validation relies upon lab experiments.

In the future, Koller hopes to develop her scheme to handle multi-species analysis - for instance, to identify gene regulatory networks that appear in both human and mouse genomes. ’’When a regulatory module is conserved across multiple species, that indicates it’s playing a significant role,’’ Koller says.

Koller’s collaborators include Eran Segal and Michael Shapira (both of Stanford), Nir Friedman (Hebrew University of Jerusalem), Aviv Regev (Harvard Center for Genome Research), Dana Pe’er (Harvard-Lipper Center for Computational Genetics), Roman Yelensky (Massachusetts Institute of Technology) and David Botstein (Princeton University).

Esther Landhuis | EurekAlert!
Further information:
http://robotics.stanford.edu/~koller/index.html
http://dags.stanford.edu
http://www.stanford.edu/news/

More articles from Information Technology:

nachricht Three components on one chip
06.12.2018 | Universität Stuttgart

nachricht New quantum materials could take computing devices beyond the semiconductor era
04.12.2018 | University of California - Berkeley

All articles from Information Technology >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: Researchers develop method to transfer entire 2D circuits to any smooth surface

What if a sensor sensing a thing could be part of the thing itself? Rice University engineers believe they have a two-dimensional solution to do just that.

Rice engineers led by materials scientists Pulickel Ajayan and Jun Lou have developed a method to make atom-flat sensors that seamlessly integrate with devices...

Im Focus: Three components on one chip

Scientists at the University of Stuttgart and the Karlsruhe Institute of Technology (KIT) succeed in important further development on the way to quantum Computers.

Quantum computers one day should be able to solve certain computing problems much faster than a classical computer. One of the most promising approaches is...

Im Focus: Substitute for rare earth metal oxides

New Project SNAPSTER: Novel luminescent materials by encapsulating phosphorescent metal clusters with organic liquid crystals

Nowadays energy conversion in lighting and optoelectronic devices requires the use of rare earth oxides.

Im Focus: A bit of a stretch... material that thickens as it's pulled

Scientists have discovered the first synthetic material that becomes thicker - at the molecular level - as it is stretched.

Researchers led by Dr Devesh Mistry from the University of Leeds discovered a new non-porous material that has unique and inherent "auxetic" stretching...

Im Focus: The force of the vacuum

Scientists from the Theory Department of the Max Planck Institute for the Structure and Dynamics of Matter (MPSD) at the Center for Free-Electron Laser Science (CFEL) in Hamburg have shown through theoretical calculations and computer simulations that the force between electrons and lattice distortions in an atomically thin two-dimensional superconductor can be controlled with virtual photons. This could aid the development of new superconductors for energy-saving devices and many other technical applications.

The vacuum is not empty. It may sound like magic to laypeople but it has occupied physicists since the birth of quantum mechanics.

All Focus news of the innovation-report >>>

Anzeige

Anzeige

VideoLinks
Industry & Economy
Event News

EGU 2019 meeting: Media registration now open

06.12.2018 | Event News

Expert Panel on the Future of HPC in Engineering

03.12.2018 | Event News

Inaugural "Virtual World Tour" scheduled for december

28.11.2018 | Event News

 
Latest News

A new molecular player involved in T cell activation

07.12.2018 | Life Sciences

High-temperature electronics? That's hot

07.12.2018 | Materials Sciences

Supercomputers without waste heat

07.12.2018 | Physics and Astronomy

VideoLinks
Science & Research
Overview of more VideoLinks >>>