"We've made huge progress in recent years with many genomes, including humans, but a lot of the problems can't be solved by simply dumping data into a computer and having truth and light come out the other end," said Indiana University Bloomington biologist Thomas Kaufman, who co-led the project. "One of the things we've learned from this project is that when you compare a lot of different but related genomes, you are more likely to see the genes that are buried in all that A-C-T-G mush."
Two papers in this week's Nature separately report the results of the four-year genome project and use the data to draw some conclusions about the fruit fly genus Drosophila, particularly its star species, the human nuisance Drosophila melanogaster. Among the papers' conclusions is the idea that resolving any individual species' genome is greatly enhanced when related genomes are compared to it. The project was primarily funded by the National Human Genome Research Institute (NHGRI), part of the National Institutes of Health.
More than 40 "companion" manuscripts are being published or are in press, each of which examines a different aspect of the data produced by the Drosophila 12 Genomes Consortium.
"This remarkable scientific achievement underscores the value of sequencing and comparing many closely-related species, especially those with great potential to enhance our understanding of fundamental biological processes," said Francis S. Collins, director of NHGRI. "Thanks to the consortium's hard work, scientists around the world now have a rich new source of genomic data that can be mined in many different ways and applied to other important model systems as well as humans."
The consortium purposely chose a wide variety of fruit flies for study, guessing correctly that both gene similarities and differences among the 12 species would be easier to identify. Some of the Drosophila species the scientists studied are closely related to D. melanogaster, some not. Some of the flies fulfill very specialized ecological niches, such as D. sechellia, which has evolved a unique ability to detoxify the fruit of the Seychelles' noni tree. The other 10 species the consortium examined were D. pseudoobscura, D. simulans, D. yakuba, D. erecta, D. ananassae, D. persimilis, D. willistoni, D. virilis, D. grimshawi, and the cactus-loving D. mojavensis. D. melanogaster's genome was published in 2000 and D. pseudoobscura's genome was published in 2005. The other genomes are newly published.
In comparing the 12 genomes, the scientists found 1,193 new protein-coding genes and hundreds of new functional elements, including regulatory sequences that determine how quickly genes are expressed, and genes that encode functional RNAs such as small nuclear RNAs. They also learned certain genes appear to be evolving faster than others, such as the genes associated with smell and taste, sex and reproduction, and defenses against pathogens.
The Drosophila 12 Genomes Consortium found that D. melanogaster shares about 77 percent of its genes with the other 11 species they studied. The scientists also found errors in about 3 percent of previously sequenced D. melanogaster protein-coding genes, correcting 414 gene sequences on record.
A vexing problem for genomicists is finding genes and other important DNA sequences in heterochromatin, tightly packed areas of chromosomes presumed to experience little expression. Heterochromatin is common in animal genomes.
"The heterochromatin is very hard to analyze," Kaufman said. "Studies show heterochromatin changes the most. It's full of intermediate- and full-repeat sequences. And there are genes buried in this stuff."
The conventions for locating the genes that encode proteins are pretty well established. The lingering problem for genomics biologists is locating genes whose parts are interrupted repeatedly, as well as locating genes that do not code for proteins.
By comparing a huge number of genomes, these sorts of genes are relatively easy to locate. Genes that do important things for cells or tissues are more likely to be "conserved" over time; that is, they don't change much despite millions of years of mutations.
One of the companion pieces accompanying this week's Nature papers was written by IUB computational biologist Matthew Hahn. Hahn reports in PLoS Genetics that although all 12 Drosophila species have about the same number of genes (14,000), the genomes are more dynamic than one might expect.
"The highest turnover in gene number occurs in genes involved in sex and reproduction," Hahn said. "Our results demonstrate that the apparent stasis in total gene number among species has masked rapid turnover in individual gene gain and loss. It is likely that this evolutionary revolving door has played a large role in shaping the morphological, physiological, and metabolic differences among species. This is the reason the 12 species only share 77 percent of their genes."
David Bricker | EurekAlert!
Cryo-electron microscopy achieves unprecedented resolution using new computational methods
24.03.2017 | DOE/Lawrence Berkeley National Laboratory
How cheetahs stay fit and healthy
24.03.2017 | Forschungsverbund Berlin e.V.
Astronomers from Bonn and Tautenburg in Thuringia (Germany) used the 100-m radio telescope at Effelsberg to observe several galaxy clusters. At the edges of these large accumulations of dark matter, stellar systems (galaxies), hot gas, and charged particles, they found magnetic fields that are exceptionally ordered over distances of many million light years. This makes them the most extended magnetic fields in the universe known so far.
The results will be published on March 22 in the journal „Astronomy & Astrophysics“.
Galaxy clusters are the largest gravitationally bound structures in the universe. With a typical extent of about 10 million light years, i.e. 100 times the...
Researchers at the Goethe University Frankfurt, together with partners from the University of Tübingen in Germany and Queen Mary University as well as Francis Crick Institute from London (UK) have developed a novel technology to decipher the secret ubiquitin code.
Ubiquitin is a small protein that can be linked to other cellular proteins, thereby controlling and modulating their functions. The attachment occurs in many...
In the eternal search for next generation high-efficiency solar cells and LEDs, scientists at Los Alamos National Laboratory and their partners are creating...
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are less stable. Now researchers at the Technical University of Munich (TUM) have, for the first time ever, produced a composite material combining silicon nanosheets and a polymer that is both UV-resistant and easy to process. This brings the scientists a significant step closer to industrial applications like flexible displays and photosensors.
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are...
Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to simulate these confined natural conditions in artificial vesicles for the first time. As reported in the academic journal Small, the results are offering better insight into the development of nanoreactors and artificial organelles.
Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to...
20.03.2017 | Event News
14.03.2017 | Event News
07.03.2017 | Event News
24.03.2017 | Materials Sciences
24.03.2017 | Physics and Astronomy
24.03.2017 | Physics and Astronomy