A team of geneticists at Los Alamos National Laboratory, together with a consortium of international researchers, has recently proposed a set of standards designed to elucidate the quality of publicly available genetic sequencing information. The new standards could eventually allow genetic researchers to develop vaccines more efficiently or help public health or security personnel more quickly respond to potential public-health emergencies.
In a recent issue of Science, Los Alamos geneticist Patrick Chain and colleagues presented six labels for genome sequence data that are, or will become, available in public databases rather than the two labels used today. The six labels would roughly characterize the completeness and accuracy—and consequently, the potential reliability—of genetic sequencing data. This is of great importance since researchers use such data on a daily basis for cross-referencing unknown genetic material with the genetic material of known organisms.
Every living organism with DNA has chromosomes containing the four molecular building blocks, or base pairs, represented by letters A, T, G, and C. One chromosome can contain millions of base pairs arranged like rungs on a ladder of DNA. The base pairs are arranged in sets of specific sequences that make up genes. These gene sequences can contain genetic instructions that help or harm an organism—for example by encoding enzymes that digest certain foods, or inducing cellular aberrations that give rise to certain diseases.
Genome researchers have catalogued genetic data from thousands of organisms and placed them in publicly available libraries. Researchers can use these libraries to crosscheck genetic data, for example when attempting to isolate an unknown public health threat, or to determine where a potentially helpful or harmful gene may be located on an organism's chromosome. For scientific fields such as biofuels research or environmental remediation, genetic data could help researchers determine whether microorganisms can efficiently break down plant matter to aid in ethanol production, or digest environmental contaminants like hydrocarbons.
However, because of the complexity of genetic data, genetic information in public libraries can range from very rough to very refined. In the past, genetic data has been classified either as "draft" or "finished," leaving a wide range of uncertainty about the potential accuracy of genetic data.
"In the past few years we've seen major advances in genetic sequencing technology, so we've seen an explosion in the amount of publicly available data," said Chain, who is lead author of the Science paper. "The amount of base-pair sequencing data generated each day is in the billions—orders of magnitude larger than what was generated a few years ago. Different sequencing technologies have different levels of accuracy. High degrees of uncertainty in a sequence can potentially lead a researcher down a wrong path that they could follow for a year or more. We now have a need for standards that will provide researchers with an unambiguous estimation of the quality of genetic sequence data."
Working with researchers from genome sequencing centers big and small—including the U.S. Department of Energy's Joint Genome Institute, the Sanger Institute, the Human Microbiome Project Jumpstart Consortium sequencing centers, Michigan State University, and the Ontario Institute for Cancer Research, among others—Chain and colleagues have proposed that sequence data be placed into one of six categories that augment the existing two categories. The six standards range from "standard draft sequence," representing minimum requirements for public submission, to a "finished sequence," the highest standard, which can be verified to contain only one sequencing error per 100,000 base pairs.
"My hope is all the major genome centers and advanced genomics groups use the gradations that fit their needs," said Chris Detter, LANL Genome Science Group Leader and Joint Genome Institute-LANL Center director. "Some centers may want all six, while some may only want three, but as long as they keep them intact, we are in good shape. Then, my hope is that the smaller genomics groups adopt the classes as written to help the rest of the scientific community know what they are generating and submitting."
Other DOE JGI authors on the Science paper include David Bruce, Phil Hugenholtz, Nikos Kyrpides, Alla Lapidus, Sam Pitluck, and Jeremy Schmutz. Other collaborating institutions are the Sanger Institute and the HMP Jumpstart Consortium sequencing centers (Washington University School of Medicine, the Broad Institute, the J. Craig Venter Institute, and Baylor College of Medicine), as well as Michigan State University, the Ontario Institute for Cancer Research, National Center for Biotechnology Information, Seattle Children's Hospital and Research Institute, Emory GRA, and the Naval Medical Research Center.
About Los Alamos National Laboratory (www.lanl.gov)
Los Alamos National Laboratory, a multidisciplinary research institution engaged in strategic science on behalf of national security, is operated by Los Alamos National Security, LLC, a team composed of Bechtel National, the University of California, The Babcock & Wilcox Company, and the Washington Division of URS for the Department of Energy's National Nuclear Security Administration.
Los Alamos enhances national security by ensuring the safety and reliability of the U.S. nuclear stockpile, developing technologies to reduce threats from weapons of mass destruction, and solving problems related to energy, environment, infrastructure, health, and global security concerns.
James E. Rickman | EurekAlert!
New photocatalyst speeds up the conversion of carbon dioxide into chemical resources
29.05.2017 | DGIST (Daegu Gyeongbuk Institute of Science and Technology)
Copper hydroxide nanoparticles provide protection against toxic oxygen radicals in cigarette smoke
29.05.2017 | Johannes Gutenberg-Universität Mainz
The world's highest gain high power laser amplifier - by many orders of magnitude - has been developed in research led at the University of Strathclyde.
The researchers demonstrated the feasibility of using plasma to amplify short laser pulses of picojoule-level energy up to 100 millijoules, which is a 'gain'...
Staphylococcus aureus is a feared pathogen (MRSA, multi-resistant S. aureus) due to frequent resistances against many antibiotics, especially in hospital infections. Researchers at the Paul-Ehrlich-Institut have identified immunological processes that prevent a successful immune response directed against the pathogenic agent. The delivery of bacterial proteins with RNA adjuvant or messenger RNA (mRNA) into immune cells allows the re-direction of the immune response towards an active defense against S. aureus. This could be of significant importance for the development of an effective vaccine. PLOS Pathogens has published these research results online on 25 May 2017.
Staphylococcus aureus (S. aureus) is a bacterium that colonizes by far more than half of the skin and the mucosa of adults, usually without causing infections....
Physicists from the University of Würzburg are capable of generating identical looking single light particles at the push of a button. Two new studies now demonstrate the potential this method holds.
The quantum computer has fuelled the imagination of scientists for decades: It is based on fundamentally different phenomena than a conventional computer....
An international team of physicists has monitored the scattering behaviour of electrons in a non-conducting material in real-time. Their insights could be beneficial for radiotherapy.
We can refer to electrons in non-conducting materials as ‘sluggish’. Typically, they remain fixed in a location, deep inside an atomic composite. It is hence...
Two-dimensional magnetic structures are regarded as a promising material for new types of data storage, since the magnetic properties of individual molecular building blocks can be investigated and modified. For the first time, researchers have now produced a wafer-thin ferrimagnet, in which molecules with different magnetic centers arrange themselves on a gold surface to form a checkerboard pattern. Scientists at the Swiss Nanoscience Institute at the University of Basel and the Paul Scherrer Institute published their findings in the journal Nature Communications.
Ferrimagnets are composed of two centers which are magnetized at different strengths and point in opposing directions. Two-dimensional, quasi-flat ferrimagnets...
24.05.2017 | Event News
23.05.2017 | Event News
22.05.2017 | Event News
29.05.2017 | Earth Sciences
29.05.2017 | Life Sciences
29.05.2017 | Physics and Astronomy