More than a decade later, researchers are finding that with the advent of the latest sequencing technologies the terms "draft" and "finished" are no longer sufficient to describe the varying levels of genome sequence quality being produced. The quality issue is of particular concern for any researcher who wants to use the sequence, in order to know its integrity and reliability.
"In the past we've been limited to two options, requiring us and the other centers to come up with internal definitions," said DOE JGI metagenomics researcher Patrick Chain at Los Alamos National Laboratory (LANL), first author of the Science paper. "But these are not clear and they're not propagated to the databases to which we submit sequences. So when users try to download genomes they get data of unknown quality with no information, or a complete genome that they assume has been checked for missing-data errors."
Chain said that when he and the other organizers of the Sequencing, Finishing, Analysis in the Future meeting hosted by LANL first gathered in 2005, they were concerned by the varying quality of the new genomes being submitted to public archives . As the meeting organizers all represented major sequencing centers (and smaller groups as well), the genome projects standards group was initiated at LANL, stimulated by these concerns.
The six categories defined by the group include:"Standard draft," which is the minimum amount of information needed for submission to a public database;
"My hope is all the major genome centers and advanced genomics groups use the gradations that fit their needs," he said. "Some centers may want all six, while some may only want three, but as long as they keep them intact we are in good shape. Then, my hope is that the smaller genomics groups adopt the classes as written to help the rest of the scientific community know what they are generating and submitting."
Chain added that the process of coming up with the proposed standards was not exactly an easy task since all major centers "have different pipelines, different sequencing techniques, different internal standards". They also recognized that the attempt to develop a "one size fits all" set of standards is still a work in progress. The definitions provided in the Science paper are fairly flexible, designed to apply regardless of the genome project or sequencing technologies employed and to meet each group's needs.
"We do expect that a number of people will comment on these standards, and possibly expand on the categories," he said, "but we feel we've covered all the bases with these six categories."
Chain said the group plans to team with the Genomic Standards Consortium, a grassroots movement begun by scientists who were concerned about the need for data collection standards in genome projects. The group has also talked to public archives such as GenBank to append these proposed standards to GenBank entries so that researchers can tell if the sequences will be useful to them. "Standards are a major issue to be tackled in genomics right now," Chain said. "These proposals are guideposts meant to inform users and generators."
Other DOE JGI authors on the study include David Bruce, Phil Hugenholtz, Nikos Kyrpides, Alla Lapidus, Sam Pitluck and Jeremy Schmutz. Other collaborating institutions are the Sanger Institute and the HMP Jumpstart Consortium sequencing centers (Washington University School of Medicine, the Broad Institute, the J. Craig Venter Institute, and Baylor College of Medicine), as well as Michigan State University, the Ontario Institute for Cancer Research, National Center for Biotechnology Information, Seattle Children's Hospital and Research Institute, Emory GRA and the Naval Medical Research Center.
The U.S. Department of Energy Joint Genome Institute, supported by DOE's Office of Science, is committed to advancing genomics in support of DOE missions related to clean energy generation and environmental characterization and cleanup. DOE JGI, headquartered in Walnut Creek, Calif., provides integrated high-throughput sequencing and computational analysis that enable systems-based scientific approaches to these challenges. Follow DOE JGI on Twitter.
David Gilbert | EurekAlert!
Scientists spin artificial silk from whey protein
24.01.2017 | Deutsches Elektronen-Synchrotron DESY
Choreographing the microRNA-target dance
24.01.2017 | UT Southwestern Medical Center
A Swedish-German team of researchers has cleared up a key process for the artificial production of silk. With the help of the intense X-rays from DESY's...
For the first time ever, a cloud of ultra-cold atoms has been successfully created in space on board of a sounding rocket. The MAIUS mission demonstrates that quantum optical sensors can be operated even in harsh environments like space – a prerequi-site for finding answers to the most challenging questions of fundamental physics and an important innovation driver for everyday applications.
According to Albert Einstein's Equivalence Principle, all bodies are accelerated at the same rate by the Earth's gravity, regardless of their properties. This...
An important step towards a completely new experimental access to quantum physics has been made at University of Konstanz. The team of scientists headed by...
Yersiniae cause severe intestinal infections. Studies using Yersinia pseudotuberculosis as a model organism aim to elucidate the infection mechanisms of these...
Researchers from the University of Hamburg in Germany, in collaboration with colleagues from the University of Aarhus in Denmark, have synthesized a new superconducting material by growing a few layers of an antiferromagnetic transition-metal chalcogenide on a bismuth-based topological insulator, both being non-superconducting materials.
While superconductivity and magnetism are generally believed to be mutually exclusive, surprisingly, in this new material, superconducting correlations...
19.01.2017 | Event News
10.01.2017 | Event News
09.01.2017 | Event News
24.01.2017 | Physics and Astronomy
24.01.2017 | Life Sciences
24.01.2017 | Health and Medicine