Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:

 

New gene prediction method capitalizes on multiple genomes

20.12.2007
Researchers at Stanford University report in the online open access journal, Genome Biology, a new approach to computationally predicting the locations and structures of protein-coding genes in a genome.

Gene finding remains an important problem in biology as scientists are still far from fully mapping the set of human genes. Furthermore, gene maps for other vertebrates, including important model organisms such as mouse, are much more incomplete than the human annotation. The new technique, known as CONTRAST (CONditionally TRAined Search for Transcripts), works by comparing a genome of interest to the genomes of several related species.

CONTRAST exploits the fact that the functional role protein-coding genes play a specific part within a cell and are therefore subjected to characteristic evolutionary pressures. For example, mutations that alter an important part of a protein's structure are likely to be deleterious and thus selected against. On the other hand, mutations that preserve a protein's amino acid sequence are normally well tolerated. Thus, protein-coding genes can be identified by searching a genome for regions that show evidence such patterns of selection. However, learning to recognize such patterns when more than two species are compared has proved difficult.

Previous systems for gene prediction were able to effectively make use of one additional 'informant' genome. For example, when searching for human genes, taking into account information from the mouse genome led to a substantial increase in accuracy. But, no system was able to leverage additional informant genomes to improve upon state-of-the-art performance using mouse alone, although it was expected that adding informants would make patterns of selection clearer. CONTRAST solves this problem by learning to recognize the signature of protein-coding gene selection in a fundamentally different way from previous approaches. Instead of constructing a model of sequence evolution, CONTRAST directly 'learns' which features of a genomic alignment are most useful for recognizing genes. This approach leads to overall higher levels of accuracy and is able to extract useful information from several informant sequences.

... more about:
»CONTRAST »Genome »accuracy »informant »protein-coding

In a test on the human genome, CONTRAST exactly predicted the full structure of 59% of the genes in the test set, compared with the previous best result of 36%. Its exact exon sensitivity of 93%, compared with a previous best of 84%, translates into many thousands of exons correctly predicted by CONTRAST but missed by previous methods. Importantly, CONTRAST's accuracy using a combination of eleven informant genomes was significantly higher than its accuracy using any single informant. The substantial advance in predictive accuracy represented by CONTRAST will further efforts to complete protein-coding gene maps for human and other organisms.

Further information about existing gene-prediction methods and the advance CONTRAST brings to the field can be found in a minireview by Paul Flicek, which accompanies the article by Batzoglou and colleagues.

Charlotte Webber | alfa
Further information:
http://genomebiology.com/
http://www.biomedcentral

Further reports about: CONTRAST Genome accuracy informant protein-coding

More articles from Life Sciences:

nachricht Predicting a protein's behavior from its appearance
10.12.2019 | Ecole Polytechnique Fédérale de Lausanne

nachricht Could dark carbon be hiding the true scale of ocean 'dead zones'?
10.12.2019 | University of Plymouth

All articles from Life Sciences >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: How to induce magnetism in graphene

Graphene, a two-dimensional structure made of carbon, is a material with excellent mechanical, electronic and optical properties. However, it did not seem suitable for magnetic applications. Together with international partners, Empa researchers have now succeeded in synthesizing a unique nanographene predicted in the 1970s, which conclusively demonstrates that carbon in very specific forms has magnetic properties that could permit future spintronic applications. The results have just been published in the renowned journal Nature Nanotechnology.

Depending on the shape and orientation of their edges, graphene nanostructures (also known as nanographenes) can have very different properties – for example,...

Im Focus: Electronic map reveals 'rules of the road' in superconductor

Band structure map exposes iron selenide's enigmatic electronic signature

Using a clever technique that causes unruly crystals of iron selenide to snap into alignment, Rice University physicists have drawn a detailed map that reveals...

Im Focus: Developing a digital twin

University of Texas and MIT researchers create virtual UAVs that can predict vehicle health, enable autonomous decision-making

In the not too distant future, we can expect to see our skies filled with unmanned aerial vehicles (UAVs) delivering packages, maybe even people, from location...

Im Focus: The coldest reaction

With ultracold chemistry, researchers get a first look at exactly what happens during a chemical reaction

The coldest chemical reaction in the known universe took place in what appears to be a chaotic mess of lasers. The appearance deceives: Deep within that...

Im Focus: How do scars form? Fascia function as a repository of mobile scar tissue

Abnormal scarring is a serious threat resulting in non-healing chronic wounds or fibrosis. Scars form when fibroblasts, a type of cell of connective tissue, reach wounded skin and deposit plugs of extracellular matrix. Until today, the question about the exact anatomical origin of these fibroblasts has not been answered. In order to find potential ways of influencing the scarring process, the team of Dr. Yuval Rinkevich, Group Leader for Regenerative Biology at the Institute of Lung Biology and Disease at Helmholtz Zentrum München, aimed to finally find an answer. As it was already known that all scars derive from a fibroblast lineage expressing the Engrailed-1 gene - a lineage not only present in skin, but also in fascia - the researchers intentionally tried to understand whether or not fascia might be the origin of fibroblasts.

Fibroblasts kit - ready to heal wounds

All Focus news of the innovation-report >>>

Anzeige

Anzeige

VideoLinks
Industry & Economy
Event News

The Future of Work

03.12.2019 | Event News

First International Conference on Agrophotovoltaics in August 2020

15.11.2019 | Event News

Laser Symposium on Electromobility in Aachen: trends for the mobility revolution

15.11.2019 | Event News

 
Latest News

City research draws on Formula 1 technology for the construction of skyscrapers

10.12.2019 | Architecture and Construction

Reorganizing a computer chip: Transistors can now both process and store information

10.12.2019 | Information Technology

Could dark carbon be hiding the true scale of ocean 'dead zones'?

10.12.2019 | Life Sciences

VideoLinks
Science & Research
Overview of more VideoLinks >>>