Gene finding remains an important problem in biology as scientists are still far from fully mapping the set of human genes. Furthermore, gene maps for other vertebrates, including important model organisms such as mouse, are much more incomplete than the human annotation. The new technique, known as CONTRAST (CONditionally TRAined Search for Transcripts), works by comparing a genome of interest to the genomes of several related species.
CONTRAST exploits the fact that the functional role protein-coding genes play a specific part within a cell and are therefore subjected to characteristic evolutionary pressures. For example, mutations that alter an important part of a protein's structure are likely to be deleterious and thus selected against. On the other hand, mutations that preserve a protein's amino acid sequence are normally well tolerated. Thus, protein-coding genes can be identified by searching a genome for regions that show evidence such patterns of selection. However, learning to recognize such patterns when more than two species are compared has proved difficult.
Previous systems for gene prediction were able to effectively make use of one additional 'informant' genome. For example, when searching for human genes, taking into account information from the mouse genome led to a substantial increase in accuracy. But, no system was able to leverage additional informant genomes to improve upon state-of-the-art performance using mouse alone, although it was expected that adding informants would make patterns of selection clearer. CONTRAST solves this problem by learning to recognize the signature of protein-coding gene selection in a fundamentally different way from previous approaches. Instead of constructing a model of sequence evolution, CONTRAST directly 'learns' which features of a genomic alignment are most useful for recognizing genes. This approach leads to overall higher levels of accuracy and is able to extract useful information from several informant sequences.
In a test on the human genome, CONTRAST exactly predicted the full structure of 59% of the genes in the test set, compared with the previous best result of 36%. Its exact exon sensitivity of 93%, compared with a previous best of 84%, translates into many thousands of exons correctly predicted by CONTRAST but missed by previous methods. Importantly, CONTRAST's accuracy using a combination of eleven informant genomes was significantly higher than its accuracy using any single informant. The substantial advance in predictive accuracy represented by CONTRAST will further efforts to complete protein-coding gene maps for human and other organisms.
Further information about existing gene-prediction methods and the advance CONTRAST brings to the field can be found in a minireview by Paul Flicek, which accompanies the article by Batzoglou and colleagues.
Russian scientists show changes in the erythrocyte nanostructure under stress
22.02.2019 | Lobachevsky University
How the intestinal fungus Candida albicans shapes our immune system
22.02.2019 | Exzellenzcluster Präzisionsmedizin für chronische Entzündungserkrankungen
An international research team including astronomers from the Max Planck Institute for Radio Astronomy in Bonn, Germany, has combined radio telescopes from five continents to prove the existence of a narrow stream of material, a so-called jet, emerging from the only gravitational wave event involving two neutron stars observed so far. With its high sensitivity and excellent performance, the 100-m radio telescope in Effelsberg played an important role in the observations.
In August 2017, two neutron stars were observed colliding, producing gravitational waves that were detected by the American LIGO and European Virgo detectors....
Up to now, OLEDs have been used exclusively as a novel lighting technology for use in luminaires and lamps. However, flexible organic technology can offer much more: as an active lighting surface, it can be combined with a wide variety of materials, not just to modify but to revolutionize the functionality and design of countless existing products. To exemplify this, the Fraunhofer FEP together with the company EMDE development of light GmbH will be presenting hybrid flexible OLEDs integrated into textile designs within the EU-funded project PI-SCALE for the first time at LOPEC (March 19-21, 2019 in Munich, Germany) as examples of some of the many possible applications.
The Fraunhofer FEP, a provider of research and development services in the field of organic electronics, has long been involved in the development of...
For the first time, an international team of scientists based in Regensburg, Germany, has recorded the orbitals of single molecules in different charge states in a novel type of microscopy. The research findings are published under the title “Mapping orbital changes upon electron transfer with tunneling microscopy on insulators” in the prestigious journal “Nature”.
The building blocks of matter surrounding us are atoms and molecules. The properties of that matter, however, are often not set by these building blocks...
Scientists at the University of Konstanz identify fierce competition between the human immune system and bacterial pathogens
Cell biologists from the University of Konstanz shed light on a recent evolutionary process in the human immune system and publish their findings in the...
Laser physicists have taken snapshots of carbon molecules C₆₀ showing how they transform in intense infrared light
When carbon molecules C₆₀ are exposed to an intense infrared light, they change their ball-like structure to a more elongated version. This has now been...
11.02.2019 | Event News
30.01.2019 | Event News
16.01.2019 | Event News
22.02.2019 | Physics and Astronomy
22.02.2019 | Materials Sciences
22.02.2019 | Life Sciences