Forum for Science, Industry and Business

Sponsored by:     3M 
Search our Site:


New methods identify thousands of new DNA sequences missing from the human genome reference map

Findings suggest that new genome assemblies based solely on next-generation sequencing might miss many of these sites

Researchers have discovered 2,363 new DNA sequences corresponding to 730 regions on the human genome by using new approaches. These sequences represent segments of the genome that were not charted in the reference map of the human genome.

"A large portion of those sequences are either missing, fragmented or misaligned when compared to results from next-generation sequencing genome assemblies on the same samples," said Dr. Evan Eichler, senior author on the findings published April 19 in the advanced online edition of Nature Methods. Eichler is a University of Washington (UW) professor of genome sciences and an investigator with the Howard Hughes Medical Institute. "These findings suggest that new genome assemblies based solely on next-generation sequencing might miss many of these sites."

Dr. Jeffrey M. Kidd was lead author of the article, which described the new techniques the research team used to find some of the missing sequences.

Kidd headed the study while earning his Ph.D. at the University of Washington in the Eichler lab. Kidd is now a postdoctoral fellow at Stanford University.

"Over the past several years, the extent to which the structure of the genome varies among humans has become clearer. This variation suggested that there must be portions of the human genome where DNA sequences had yet to be discovered, annotated and characterized," he said "We hope that these sequences ultimately will be included as part of future releases of the reference human genome sequence."

The reference genome is a yardstick – or standard for comparison – for studies of human genetics.

The human reference genome was first created in 2001 and is updated every couple of years, Kidd explained. It's a mosaic of DNA sequences derived from several individuals. He went on to say that about 80 percent of the reference genome came from eight people. One of them actually accounts for more than 66 percent of the total.

Along with their collaborators at Agilent, the team designed ways to examine these newly identified sequences in a panel of people representing populations from around the world. The researchers found that, in some cases, the number of copies of these sequences varied from person to person.

The fact that a person can have one or more copies, or no copy at all, of a particular DNA sequence may account for why these sequences were missing from the reference genome. The researchers also found that some of these sequences were common or rare in different populations, depending on from which part of the globe their ancestors originated.

"Each segment of the reference genome is from a single person, and reflects the genome of that individual. If the donor sample was missing a sequence that many other people have, that sequence would not be represented in the reference genome." Kidd explained. "That is why some of the positions on the reference genome represent rare structural configurations or entirely omit sequences found in the majority of people." Kidd said that the study published in Nature Methods used information from nine individuals, representing various world populations, to search for and fill in some of the missing pieces.

By looking at genomes from seven kinds of animals, the researchers were also able to show that some of the newly identified DNA sequences appear to have been conserved during the evolution of mammals and man. The animals whose genomes were studied were chimpanzee, Bornean orangutan, Rhesus monkey, house mouse, Norway rat, dog, and horse.

"Some of the sequences were present in several different species, but were absent from the reference genome," Kidd said. "Some of the sequences present in several mammals actually correspond to sites of variations in humans – some people have retained a particular sequence, and others have lost it."

The researchers also developed a method to accurately genotype many of the newly found DNA sequences and created a way to look at variations in the number of copies of these sequences, thereby opening up regions of the human genome previously inaccessible to such studies.

"Scientists can now begin trying to understand the functional importance of these sequences and their variations," Kidd said.

The 1,000 Genomes Project (an international effort to fully sequence the genomes of a thousand anonymous individuals) and other genome studies are amassing massive amounts of data on DNA sequences that are then mapped to the reference genome, he added. Any study, he continued, that improves the completeness and quality of the reference genome assembly will thereby benefit these projects and lead to a fuller picture of the extent of human genomic variation.

The findings are published as "Characterization of missing human genome sequences and copy-number polymorphic insertions," in Nature Methods.

In addition to Kidd and Eichler, other researchers on the study were Nick Sampas, Paige Anderson, Anya Tsalenko, N. Alice Yamada, Peter Tsang, and Laurakay Bruhn, all of Agilent Laboratories, Santa Clara, Calif.; Francesca Antonacci, Hillary S. Hayden, Can Alkan, and Maika Malig, all of the University of Washington in Seattle; Tina Graves, Robert Fulton, Joelle Kallicki, and Richard K. Wilson, all of the Genome Sequencing Center at the Washington University School of Medicine in St. Louis; and Mario Ventura and Giuliana Giannuzzi of the Department of Genetics and Microbiology, University of Bari, Italy.

Kidd's work on this study was supported by a U.S. National Science Foundation Graduate Research Fellowship. The study was funded by a grant to Eichler from the National Institutes of Health entitled "Human Genome Structural Variation."

Leila Gray | EurekAlert!
Further information:

Further reports about: DNA DNA sequence Nature Immunology genome sequence human genome methods

More articles from Life Sciences:

nachricht Novel mechanisms of action discovered for the skin cancer medication Imiquimod
21.10.2016 | Technische Universität München

nachricht Second research flight into zero gravity
21.10.2016 | Universität Zürich

All articles from Life Sciences >>>

The most recent press releases about innovation >>>

Die letzten 5 Focus-News des innovations-reports im Überblick:

Im Focus: New 3-D wiring technique brings scalable quantum computers closer to reality

Researchers from the Institute for Quantum Computing (IQC) at the University of Waterloo led the development of a new extensible wiring technique capable of controlling superconducting quantum bits, representing a significant step towards to the realization of a scalable quantum computer.

"The quantum socket is a wiring method that uses three-dimensional wires based on spring-loaded pins to address individual qubits," said Jeremy Béjanin, a PhD...

Im Focus: Scientists develop a semiconductor nanocomposite material that moves in response to light

In a paper in Scientific Reports, a research team at Worcester Polytechnic Institute describes a novel light-activated phenomenon that could become the basis for applications as diverse as microscopic robotic grippers and more efficient solar cells.

A research team at Worcester Polytechnic Institute (WPI) has developed a revolutionary, light-activated semiconductor nanocomposite material that can be used...

Im Focus: Diamonds aren't forever: Sandia, Harvard team create first quantum computer bridge

By forcefully embedding two silicon atoms in a diamond matrix, Sandia researchers have demonstrated for the first time on a single chip all the components needed to create a quantum bridge to link quantum computers together.

"People have already built small quantum computers," says Sandia researcher Ryan Camacho. "Maybe the first useful one won't be a single giant quantum computer...

Im Focus: New Products - Highlights of COMPAMED 2016

COMPAMED has become the leading international marketplace for suppliers of medical manufacturing. The trade fair, which takes place every November and is co-located to MEDICA in Dusseldorf, has been steadily growing over the past years and shows that medical technology remains a rapidly growing market.

In 2016, the joint pavilion by the IVAM Microtechnology Network, the Product Market “High-tech for Medical Devices”, will be located in Hall 8a again and will...

Im Focus: Ultra-thin ferroelectric material for next-generation electronics

'Ferroelectric' materials can switch between different states of electrical polarization in response to an external electric field. This flexibility means they show promise for many applications, for example in electronic devices and computer memory. Current ferroelectric materials are highly valued for their thermal and chemical stability and rapid electro-mechanical responses, but creating a material that is scalable down to the tiny sizes needed for technologies like silicon-based semiconductors (Si-based CMOS) has proven challenging.

Now, Hiroshi Funakubo and co-workers at the Tokyo Institute of Technology, in collaboration with researchers across Japan, have conducted experiments to...

All Focus news of the innovation-report >>>



Event News

#IC2S2: When Social Science meets Computer Science - GESIS will host the IC2S2 conference 2017

14.10.2016 | Event News

Agricultural Trade Developments and Potentials in Central Asia and the South Caucasus

14.10.2016 | Event News

World Health Summit – Day Three: A Call to Action

12.10.2016 | Event News

Latest News

Resolving the mystery of preeclampsia

21.10.2016 | Health and Medicine

Stanford researchers create new special-purpose computer that may someday save us billions

21.10.2016 | Information Technology

From ancient fossils to future cars

21.10.2016 | Materials Sciences

More VideoLinks >>>