The lead report from the ENCyclopedia Of DNA Elements (ENCODE) Consortium, published in Nature, together with 28 companion papers published in Genome Research, defined in detail which regions of the genome are actively copied in the cell, revealed the location and studied evolution of elements that control gene activity, and defined the relationship between DNA-associated proteins and gene activity and DNA replication.
The complex tapestry of interwoven elements revealed today suggests that "our perspective of transcription and genes may have to evolve," the researchers state, noting that their research "poses some interesting mechanistic questions" that have yet to be answered.
Our understanding of genome biology from the Human Genome Project gave us an overview of a 3-billion-base genome, peppered with some 22,000 discrete genes and the sequences that regulate their activity. These were estimated to occupy perhaps 3-5% of the genome, though this number is expected to be an underestimate.
"The new view transforms our view of the genomic fabric," explained Dr Tim Hubbard, from the Wellcome Trust Sanger Institute, "The majority of the genome is copied, or transcribed, into RNA, which is the active molecule in our cells, relaying information from the archival DNA copy to the cellular machinery. This is a remarkable finding, since most prior research suggested only a fraction of the genome was transcribed."
"But it is our new understanding of regulation of genes that stands out. The integrated approach has helped us to identify new regions of gene regulation and altered our view of how gene regulation occurs."
From the earliest studies of gene activity in bacteria, a picture emerged that suggested control regions were most often located at or near sites from which gene transcription started. The new work identifies many previously unknown control regions and shows that control regions are as likely to be beyond the end of the gene.
Alterations in control regions are increasingly thought to be of significance for human disease, Dr Dermitzakis from the Wellcome Trust Sanger Institute and one of the corresponding authors on the paper explained: "For the first time we can see DNA sequence variation in the context of the biochemical workings of the cell. We can now begin to unravel the consequences that such variations hold for individuals and their susceptibility to disease."
The team showed that transcription of DNA is pervasive across the genome, and that RNA transcripts overlap known genes and are found in what were previously thought to be gene ‘deserts'.
"A major surprise was that many of the novel control regions are not shared with other species, but restricted to our human genome," continued Dr Dermitzakis. "We appear to have a reservoir of active elements that seem to provide no specific or direct benefit.
"Our suggestion is that these elements can provide a source for new variation between species and within the human genome. This is our genomic seedcorn for the future. "
The scale of the collaboration brings new understanding of the interaction between our genome and the proteins that control gene activity and DNA replication. The results show that proteins called histones that bind DNA to package it within the cell nucleus are modified to promote or inhibit gene activity and can be used to predict better the location of novel genes.
"Specific types of modifications of the histone proteins near gene starts are a strong predictor of gene activity," explained Dr Ian Dunham, from the Wellcome Trust Sanger Institute, "whereas further histone modifications at sites away from genes appear to be a signature of regulatory elements that can enhance transcription." A detailed analysis of these effects is also published by the Sanger Institute group in one of the companion papers in Genome Research.
"It is only from a study such as ENCODE that we can obtain such a valuable detailed view of our genome. This project has been a magnificent collaboration amongst some of the world's premier genome scientists, and has revealed many new insights. There is every expectation that a great deal more will be revealed as the project scales to the whole genome."
Although much that is new has been discovered, much yet remains to be understood. Similarity of DNA sequence between species is often a sign of the value of that sequence, yet a function has not been found for many DNA sequences that are conserved. The role of the massive new numbers of RNA transcripts is unknown. And the function of the large number of control elements is yet to be elucidated.
The ENCODE consortium is organized by the National Human Genome Research Institute (NHGRI), whose Director, Francis S Collins, MD PhD, said: "This impressive effort has uncovered many exciting surprises and blazed the way for future efforts to explore the functional landscape of the entire human genome.
"Because of the hard work and keen insights of the ENCODE consortium, the scientific community will need to rethink some long-held views about what genes are and what they do, as well as how the genome's functional elements have evolved. This could have significant implications for efforts to identify the DNA sequences involved in many human diseases."
Don Powell | alfa
Cryo-electron microscopy achieves unprecedented resolution using new computational methods
24.03.2017 | DOE/Lawrence Berkeley National Laboratory
How cheetahs stay fit and healthy
24.03.2017 | Forschungsverbund Berlin e.V.
Astronomers from Bonn and Tautenburg in Thuringia (Germany) used the 100-m radio telescope at Effelsberg to observe several galaxy clusters. At the edges of these large accumulations of dark matter, stellar systems (galaxies), hot gas, and charged particles, they found magnetic fields that are exceptionally ordered over distances of many million light years. This makes them the most extended magnetic fields in the universe known so far.
The results will be published on March 22 in the journal „Astronomy & Astrophysics“.
Galaxy clusters are the largest gravitationally bound structures in the universe. With a typical extent of about 10 million light years, i.e. 100 times the...
Researchers at the Goethe University Frankfurt, together with partners from the University of Tübingen in Germany and Queen Mary University as well as Francis Crick Institute from London (UK) have developed a novel technology to decipher the secret ubiquitin code.
Ubiquitin is a small protein that can be linked to other cellular proteins, thereby controlling and modulating their functions. The attachment occurs in many...
In the eternal search for next generation high-efficiency solar cells and LEDs, scientists at Los Alamos National Laboratory and their partners are creating...
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are less stable. Now researchers at the Technical University of Munich (TUM) have, for the first time ever, produced a composite material combining silicon nanosheets and a polymer that is both UV-resistant and easy to process. This brings the scientists a significant step closer to industrial applications like flexible displays and photosensors.
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are...
Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to simulate these confined natural conditions in artificial vesicles for the first time. As reported in the academic journal Small, the results are offering better insight into the development of nanoreactors and artificial organelles.
Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to...
20.03.2017 | Event News
14.03.2017 | Event News
07.03.2017 | Event News
24.03.2017 | Materials Sciences
24.03.2017 | Physics and Astronomy
24.03.2017 | Physics and Astronomy