Some Genetic Research is Best Done Close to the Evolutionary Home

“While one can compare distant vertebrates to humans and identify sequences that are highly evolutionarily conserved, such elements are few and far between,” said Len Pennacchio, a geneticist with Berkeley Lab’s Genomics Division and the head of JGI’s genome analysis program. “In contrast, by comparing species that are more closely related, such as other mammals, we can find much more DNA sequence alignment.”

Pennacchio and Shyam Prabhakar are the principal authors of a paper that appears in the June issue of the publication Genome Research, which presents the results of a comparative genomics study that quantified the advantages of staying close to the evolutionary home. Other co-authors of the paper were Francis Poulin, Malak Shoukry, Veena Afzal, Edward Rubin and Olivier Couronne.

When Mother Nature develops something that works, she tends to stick with it. Hence sequences of DNA that serve as protein-coding genes or enhancers that regulate the expression of those genes have been conserved through thousands of years of evolution. Gene hunters have capitalized on this tendency by comparing the DNA of different species to identify genes and determine their functions. For example, the genome of the Fugu fish contains essentially the same genes as the human genome but carries them in approximately 400 million bases as compared to the three billion bases that make up human DNA.

Cross-species DNA sequence comparisons have also been used to identify the enhancers that regulate genes – meaning they control whether a gene is switched on or off — but until now, the relative merits of comparing species as diverse as humans and fish were not known.

“To address this problem, we identified evolutionarily conserved non-coding regions in primate, mammalian and more distant species using a uniform approach that facilitates an unbiased assessment of the impact of evolutionary distance on predictive power,” said Pennacchio. “We benchmarked computational predictions against previously identified regulatory elements at diverse genomic loci, and also tested numerous extremely conserved sequences in humans and rodents for enhancer activity.”

The computational algorithm, which is used to provide a uniform evaluation of the benefits and limitations of DNA sequence comparisons between close versus distant species, was developed by Prabhakar. He dubbed this program “Gumby,” after a mathematical concept called the Gumbel distribution. Prabhakar’s Gumby program has now been incorporated into VISTA, the comprehensive suite of programs and databases for comparative analysis of genomic sequences that was developed and is maintained at Berkeley Lab.

Using the Gumby program, Prabhakar, Pennacchio and their colleagues were able to identify human regulatory DNA sequences with a sensitivity that ranged from 53 to 80 percent, and a true-positive rate that ran as high as 67 percent based on comparisons with primates and other eutherian (placental) mammals. By contrast, comparisons with more distant species, including marsupial, avian, amphibian and fish, failed to identify most of the empirically defined functional non-coding DNA sequences.

Said Prabhakar, “Our results highlight the practical utility of close sequence comparisons, and the loss of sensitivity entailed by more distant comparisons. The intuitive relationship we derived between ancient and recent non-coding sequence conservation from whole-genome comparative analysis explains most of the observations from empirical benchmarking.”

This research was supported by the National Heart, Lung, and Blood Institute, through its Program for Genome Applications.

Berkeley Lab is a U.S. Department of Energy National Laboratory located in Berkeley, CA. It conducts unclassified scientific research and is managed by the University of California. Visit our Website at http://www.lbl.gov/.

Media Contact

Lynn Yarris EurekAlert!

More Information:

http://www.lbl.gov

All latest news from the category: Life Sciences and Chemistry

Articles and reports from the Life Sciences and chemistry area deal with applied and basic research into modern biology, chemistry and human medicine.

Valuable information can be found on a range of life sciences fields including bacteriology, biochemistry, bionics, bioinformatics, biophysics, biotechnology, genetics, geobotany, human biology, marine biology, microbiology, molecular biology, cellular biology, zoology, bioinorganic chemistry, microchemistry and environmental chemistry.

Back to home

Comments (0)

Write a comment

Newest articles

A universal framework for spatial biology

SpatialData is a freely accessible tool to unify and integrate data from different omics technologies accounting for spatial information, which can provide holistic insights into health and disease. Biological processes…

How complex biological processes arise

A $20 million grant from the U.S. National Science Foundation (NSF) will support the establishment and operation of the National Synthesis Center for Emergence in the Molecular and Cellular Sciences (NCEMS) at…

Airborne single-photon lidar system achieves high-resolution 3D imaging

Compact, low-power system opens doors for photon-efficient drone and satellite-based environmental monitoring and mapping. Researchers have developed a compact and lightweight single-photon airborne lidar system that can acquire high-resolution 3D…

Partners & Sponsors