Non-coding DNA sequences found in all plants may have undiscovered roles in basic plant development and response to the environment
DNA is the molecule that encodes the genetic instructions enabling a cell to produce the thousands of proteins it typically needs. The linear sequence of the A, T, C, and G bases in what is called coding DNA determines the particular protein that a short segment of DNA, known as a gene, will encode.
But in many organisms, there is much more DNA in a cell than is needed to code for all the necessary proteins. This non-coding DNA was often referred to as "junk" DNA because it seemed unnecessary. But in retrospect, we did not yet understand the function of these seemingly unnecessary DNA sequences.
We now know that non-coding DNA can have important functions other than encoding proteins. Many non-coding sequences produce RNA molecules that regulate gene expression by turning them on and off. Others contain enhancer or inhibitory elements.
Recent work by the international ENCODE (Encyclopedia of DNA Elements) Project (1, 2) suggested that a large percentage of non-coding DNA, which makes up an estimated 95% of the human genome, has a function in gene regulation. Thus, it is premature to say that "junk" DNA does not have a function—we just need to find out what it is!
To help understand the importance of this large amount of non-coding DNA in plants, Diane Burgess and Michael Freeling at the University of California, Berkeley have identified numerous conserved non-coding sequences (CNSs) of DNA that are found in a wide variety of plant species, including rice, banana, and cacao.
DNA sequences that are highly conserved, meaning that they are identical or nearly so in a variety of organisms, are likely to have important functions in basic biological processes. For example, the gene encoding ribosomal RNA, an essential part of the protein-synthesizing machinery needed by cells of all organisms, is highly conserved. Changes in the sequence of this key molecule are poorly tolerated, so ribosomal RNA sequences have changed relatively little over millions of years of evolution.
To identify the most highly conserved plant CNSs, Burgess and Freeling compared the genome (one copy of all the DNA in an organism) of the model plant Arabidopsis, a member of the mustard family, with the genome of columbine, a distantly related plant of the buttercup family.
The phylogenetic tree (see figure) shows the evolutionary relationships among the dicot (yellow) and monocot (blue) species they studied. Branch points represent points of divergence of two species from a common ancestor. Sequences in common between these two plants, which diverged over 130 million years ago, are likely to have important functions or they would have been lost due to random mutations or insertions or deletions.
They found over 200 CNSs in common between these distantly related species. In addition, 59 of these CNSs were also found in monocots, which are even more distant evolutionarily, and these were termed deep CNSs. Finally, they showed that 51 of these appear to be found in all flowering plants, based on their occurrence in Amborella, a flowering plant that diverged from all of the above plants even before the monocot-dicot split (see figure).
So what could be the function of these deep CNSs? We can get clues by analyzing the types of genes with which these CNSs are associated. The researchers found that nearly all of the deep CNSs are associated with genes involved in basic and universal biological processes in flowering plants—processes such as development, response to hormones, and regulation of gene expression.
They found that the majority of these CNSs are associated with genes involved in tissue and organ development, post-embryonic differentiation, flowering, and production of reproductive structures. Others are associated with hormone- and salt-responsive genes or with genes encoding transcription factors, which are regulatory proteins that control gene expression by turning other genes on and off.
In addition, they showed that these CNSs are enriched for binding sites for transcription factors, and propose that the function of some of this non-coding DNA is to act as a scaffold for organization of the gene expression machinery. The binding sites they found are known sequences implicated in other plants as necessary for response to biotic and abiotic stress, light, and hormones.
Furthermore, they discovered that a number of the CNSs could produce RNAs that have extensive double-stranded regions. These double-stranded regions have been shown to be involved in RNA stability, degradation, and in regulation of gene expression. Twelve of the most 59 highly conserved CNSs are associated with genes whose protein products interact with RNA. Clearly, these DNA sequences are not merely "junk!"
Now that Burgess and Freeling have identified the most highly conserved non-coding DNA sequences in flowering plants, future scientists have a better idea of which regions of the genome to focus on for functional studies. Do the predicted transcription factor-binding sites actually bind known or novel transcription factors? Do CNSs organize or regulate the gene expression machinery? Do CNSs encode RNAs that regulate fundamental processes in plants? The answers to these and many related questions will be easier to answer now that we have this set of deep CNSs that are likely to play important roles in basic cellular processes in plants.
(1) National Human Genome Research Institute (see http://www.genome.gov/10005107)
(2) Genome Research, Vol. 17, June 2007, special issue on ENCODE.
Science Editor, The Plant Cell
American Society of Plant Biologists
This work was supported by The National Science Foundation (IOS1248106).
Full citation: Burgess, D., and Freeling, F. (2014). The most deeply conserved noncoding sequences in plants serve similar functions to those in vertebrates despite large differences in evolutionary rates. Plant Cell 10.1105/tpc.113.121905.
Tyrone Spady | EurekAlert!
Pathogenic bacteria hitchhiking to North and Baltic Seas?
22.07.2016 | Alfred-Wegener-Institut, Helmholtz-Zentrum für Polar- und Meeresforschung
Unconventional quasiparticles predicted in conventional crystals
22.07.2016 | Max-Planck-Institut für Chemische Physik fester Stoffe
Munich Physicists have developed a novel electron microscope that can visualize electromagnetic fields oscillating at frequencies of billions of cycles per second.
Temporally varying electromagnetic fields are the driving force behind the whole of electronics. Their polarities can change at mind-bogglingly fast rates, and...
Breakup of continents with two speed: Continents initially stretch very slowly along the future splitting zone, but then move apart very quickly before the onset of rupture. The final speed can be up to 20 times faster than in the first, slow extension phase.phases
Present-day continents were shaped hundreds of millions of years ago as the supercontinent Pangaea broke apart. Derived from Pangaea’s main fragments Gondwana...
Scaffolding and specialised workers help with the delivery – Heidelberg biochemists gain new insights into biogenesis
A type of scaffolding on which specialised workers ply their trade helps in the manufacturing process of the two subunits from which the ribosome – the protein...
Scientists at the Helmholtz Zentrum München have developed a new mass spectrometry imaging method which, for the first time, makes it possible to analyze hundreds of metabolites in fixed tissue samples. Their findings, published in the journal Nature Protocols, explain the new access to metabolic information, which will offer previously unexploited potential for tissue-based research and molecular diagnostics.
In biomedical research, working with tissue samples is indispensable because it permits insights into the biological reality of patients, for example, in...
Chemists at the University of Basel have succeeded in using computer simulations to elucidate transient structures in proteins. In the journal Angewandte Chemie, the researchers set out how computer simulations of details at the atomic level can be used to understand proteins’ modes of action.
Using computational chemistry, it is possible to characterize the motion of individual atoms of a molecule. Today, the latest simulation techniques allow...
15.07.2016 | Event News
15.07.2016 | Event News
11.07.2016 | Event News
22.07.2016 | Information Technology
22.07.2016 | Physics and Astronomy
22.07.2016 | Life Sciences