To learn more about the community of symbiotic microbes that outnumber our own somatic and germ cells by a 10:1 ratio, the National Institutes of Health (NIH) in 2008 launched the Human Microbiome Project (HMP) - a microbiome is the full complement of microorganisms populating a supraorganism.
The goal of the HMP is to sequence the genomes of 1,000 or more of these microbial species and assemble the information in a "project catalog" as a reference for future investigations. The project catalog is housed at the HMP Data Acquisition and Coordination Center (DACC), which was created and is maintained by researchers with the U.S. Department of Energy's Lawrence Berkeley National Laboratory (Berkeley Lab).
"The HMP project catalog is a unique worldwide resource," says molecular biologist Nikos Kyrpides of Berkeley Lab's Genomics Division, who heads the Genome Biology and Metagenomics Programs for the Joint Genome Institute (JGI) and is the co-principal investigator of the DACC. "It has a central role in the HMP, not only in maintaining the list and status of over 1,400 individual human microbiome projects, but also as a data managements system for the metadata associated with these projects, such as information on the microbial isolation sites and the sites in the human body where these microbes can be found, and information on the phenotypic properties of these microbes."
At JGI, Kyrpides oversees projects such as GenePRIMP, a highly rated quality control program for genome sequencing, and GOLD, the Genomes On-Line Database. GenePRIMP stands for "Gene PRediction IMprovement Pipeline, and it consists of a series of computational units that can be used to significantly improve the overall quality of the predicted genes in any sequenced genome. The results identify gene-calling errors such as potentially incorrect gene start and end positions, large overlaps between genes, and fragmented or missed genes. GOLD provides comprehensive information on genome sequencing projects, including metagenomes and metadata from around the world. The HMP project catalog is powered by the GOLD database and provides a specialized user interface by which the data stored in GOLD can be read.
The other co-principal investigator of the DACC is Victor Markowitz who heads Berkeley Lab's Biological Data Management and Technology Center in the Computational Research Division, and also serves as the Chief Informatics Officer and Associate Director at JGI. Markowitz oversees the development and maintenance of the Integrated Microbial Genomics with Microbiome samples (IMG/M) system, which provides comparative analysis tools for the study of metagenomes – the collective genetic material of a given microbiome. First released in 2006, IMG/M contains millions of annotated microbial gene sequences, recovered from wild varieties of microbial communities. IMG/M is now being applied to the HMP.
"Resources such as GenePRIMP, GOLD and IMG/M are among the best in the world when it comes to providing comparative analysis tools for microbial genomes and metagenomes," Markowitz says. "As the HMP moves forward, these resources will provide support for the annotation and analysis of HMP datasets, in particular via the metagenome annotation pipeline at JGI and a HMP specific version of the IMG/M system."
The first 178 reference microbial genomes have now been analyzed and catalogued by the HMP. The results were published in the journal Science in a paper titled, "A Catalog of Reference Genomes from the Human Microbiome."
In this paper, HMP researchers report comparing data from the sequenced reference genomes to human metagenomic data in the public domain to identify proteins, determine gene functionality and link metagenomic data to individual microbial species. From an analysis of 547,968 predicted proteins, the HMP researchers report 29,987 unique proteins, which suggests a far greater diversity in the human microbiome than previously suspected.
"The Science paper is a milestone in the human microbiome research with the release to the public of 178 finished or high quality draft genomes from organisms isolated from various sites in the human body," says Kyrpides. "It signals the beginning of a much larger effort that aims to provide a more comprehensive genetic catalog of the microbes living in the human body. The impact of understanding what is the normal microbial flora, what is its core genetic content, and how perturbations of the normal microbial flora of the human body can shift from protecting our bodies into causing diseases will eventually be enormous."
Kyrpides, Markowitz and their colleagues at the DACC are playing a critical role in fulfilling an NIH call for development of common sequencing and annotation standards that have not existed before. Lack of common language and a clearing house for genome data have been among the most daunting problems in genomics research.
Says Markowitz, "The greatest challenge ahead will be handling hundred of metagenomic datasets generated as part of the HMP, which will represent several orders of magnitude more data than the datasets presented in the current paper. We need to develop novel analysis and visualization methods to handle this massive increase in data."
Adds Kyrpides, "New sequencing technologies and our ability to generate orders of magnitude more data compared to only a year or two ago are changing the field entirely, and are mandating a social shift among the scientists involved to a more collaborative rather than competitive spirit. None of us can provide solutions alone any more, and joint efforts such as the HMP are the only way we'll succeed."
Other Berkeley Lab/JGI researchers with prominent roles in the HMP include Gary Andersen, Todd DeSantis, Amy Chen, Konstantinos Liolios, Amrita Pati and Konstantinos Mavrommatis.
Berkeley Lab is a U.S. Department of Energy (DOE) national laboratory located in Berkeley, California. It conducts unclassified scientific research and is managed by the University of California for the DOE Office of Science.
Lynn Yarris | EurekAlert!
Climate Impact Research in Hannover: Small Plants against Large Waves
17.08.2018 | Leibniz Universität Hannover
First transcription atlas of all wheat genes expands prospects for research and cultivation
17.08.2018 | Leibniz-Institut für Pflanzengenetik und Kulturpflanzenforschung
New design tool automatically creates nanostructure 3D-print templates for user-given colors
Scientists present work at prestigious SIGGRAPH conference
Most of the objects we see are colored by pigments, but using pigments has disadvantages: such colors can fade, industrial pigments are often toxic, and...
Scientists at the University of California, Los Angeles present new research on a curious cosmic phenomenon known as "whistlers" -- very low frequency packets...
Scientists develop first tool to use machine learning methods to compute flow around interactively designable 3D objects. Tool will be presented at this year’s prestigious SIGGRAPH conference.
When engineers or designers want to test the aerodynamic properties of the newly designed shape of a car, airplane, or other object, they would normally model...
Researchers from TU Graz and their industry partners have unveiled a world first: the prototype of a robot-controlled, high-speed combined charging system (CCS) for electric vehicles that enables series charging of cars in various parking positions.
Global demand for electric vehicles is forecast to rise sharply: by 2025, the number of new vehicle registrations is expected to reach 25 million per year....
Proteins must be folded correctly to fulfill their molecular functions in cells. Molecular assistants called chaperones help proteins exploit their inbuilt folding potential and reach the correct three-dimensional structure. Researchers at the Max Planck Institute of Biochemistry (MPIB) have demonstrated that actin, the most abundant protein in higher developed cells, does not have the inbuilt potential to fold and instead requires special assistance to fold into its active state. The chaperone TRiC uses a previously undescribed mechanism to perform actin folding. The study was recently published in the journal Cell.
Actin is the most abundant protein in highly developed cells and has diverse functions in processes like cell stabilization, cell division and muscle...
17.08.2018 | Event News
08.08.2018 | Event News
27.07.2018 | Event News
17.08.2018 | Physics and Astronomy
17.08.2018 | Information Technology
17.08.2018 | Life Sciences