Despite being led and funded by a private company, Mars Inc., Cacao Genome Database scientists say one of their chief concerns has been making sure the Theobroma cacao genome data was published for all to see -- especially cacao farmers and breeders in West Africa, Asia and South America, who can use genetic information to improve their planting stocks and protect their often-fragile incomes.
"When you have to wait three or more years for a tree you plant to bear the beans you sell, you want as much information as possible about the seedlings you're planting," said Keithanne Mockaitis, IU Center for Genomics and Bioinformatics (CGB) sequencing director and IU project leader. "We expect this information will positively impact some of the poorest regions in the world, where tropical tree crops are grown. Making the genome data public further enables breeders, farmers and researchers around the world to use a common set of tools, and to share information that will help them fight the spread of disease in their crops."
Mockaitis, a biochemist-turned-genomicist, joined the project in early 2009, and quickly set to work with her collaborators to tackle the challenge of sequencing and accurately pasting together the approximately 400 million base pairs of the tree's genome. Mockaitis' Cacao Genome Group partners at the U.S. Department of Agriculture's Subtropical Horticulture Research Station in Miami sent samples to Bloomington, and these were prepared and sequenced in a redundant manner by her sequencing team in the CGB genomics laboratory. Sequence of some of the same material was generated using additional methods in laboratories of the USDA Agricultural Research Service (USDA-ARS) and at the National Center for Genome Resources in Santa Fe, N.M.
Raw data were then sent to HudsonAlpha Biotechnology Institute, a partner of the U.S. Department of Energy-funded Joint Genome Institute, for assembly. Other important datasets generated by Mockaitis' group were not the sequences of the DNA itself, but of the RNA, or transcripts produced in different tissues of the tree. Transcript sequences reveal which genes are expressed (turned on).
Finally, IU Bloomington Department of Biology scientist Don Gilbert analyzed both the genome and transcriptome sequences and generated the annotations that point to the locations in which each active gene and its components (exons and introns) reside.
"The final number of genes is still being counted and validated, but we currently estimate the cacao plant has about 35,000 genes," Mockaitis said.
That's a typical gene number for flowering plants whose genomes have thus far been sequenced. Humans have approximately 30,000 genes. Rice has about 40,000.
Since its inception about 11 years ago, the CGB has been involved in dozens of different projects that address the workings of different species' genomes with the use of high-throughput technologies.
"Cacao is something of a first for us," Mockaitis said. "This is the largest genome the CGB has sequenced to date. As a group we now have more experience and more resources to take on a wider variety of projects."
Mockaitis says the relative efficiency of the project so far has been due to Mars' support of the academic and non-profit contributing laboratories.
"We've benefited from having a collegial group of researchers, from the USDA-ARS and a variety of genomics-focused laboratories, that each bring different scientific expertise to the table to complete this genome. It's also been particularly inspiring to see West African cacao researchers come to some of our meetings -- they listen to us talk about the esoteric technologies we're using, and we know that they'll soon go to work and start benefitting from the data. That's a rare treat for an academic researcher."
Mockaitis was first introduced to this project through Roche Diagnostics, based in Indianapolis, which owns the 454 Sequencing technology. Her group had developed improved methods for the sequencing of transcripts (active gene products, above), and was asked to contribute some data to the project. Since then the IU CGB has been able to contribute to the sequence of the genome itself as well.
Unlike some other food products, such as corn or wheat, which are often grown on large, industrial farms, cacao is almost exclusively grown on small farms. There are about 6.5 million chocolate farmers around the world, primarily in West Africa, northern South America, and Southeast Asia. The United States produces virtually no chocolate on its own, instead opting to engage cacao-growing countries with economic policies that support the production and trade of what may be the world's most popular food.
"Genome sequencing helps eliminate much of the guess-work of traditional crop cultivation," said Howard-Yana Shapiro, global staff officer of plant science and research at Mars Inc. "Cocoa is what some researchers describe as an 'orphan crop' because it has been the subject of little agricultural research compared to corn, wheat and rice. This effort, which will allow fast and accurate traditional breeding, is about applying the best of what science has to offer in taking an under-served crop and under-served population and giving them both the chance to flourish."
Mockaitis says she hopes the project will have a positive impact on the farmers' lives and livelihoods.
"It is an export crop that can reduce poverty," she said. "I believe the work our groups have done will eventually help small farmers stay in business over time, because improved breeding programs based on reliable genome data will give them plants naturally equipped to fight off disease and to thrive in their specific location. This will lead to more sustainable crops and of course a more stable chocolate supply for all of us -- pretty important!"
IU's participation in the project was wholly funded by Mars Inc. Mars is a privately held company that produces a number of chocolate products, including M&Ms. Computing and data transmission were accomplished using the NSF-funded TeraGrid, a national resource co-maintained by IU in Indianapolis.
The Cacao Genome Database is a consortium of academic, government and industry partners, including Mars Inc., the U.S. Department of Agriculture, Indiana University, Clemson University, Washington State University, the National Center for Genome Resources, the nonprofit PIPRA (Public Intellectual Property Resources for Agriculture), and the HudsonAlpha Institute.
Keithanne Mockaitis led IU's participation in the project, which continues this year to add important details to the cacao genome. Other IU contributors are Research Associates Zach Smith and James Ford, who perform sequencing in the laboratory using advanced technologies, and data analysts Ram Podicheti and Justin Choi, all of the CGB. Bioinformaticist Don Gilbert is a member of the IU Bloomington Department of Biology.
To speak with IU Project Leader Keithanne Mockaitis, please contact David Bricker, University Communications, at 812-856-9035 or email@example.com.
David Bricker | EurekAlert!
Transport of molecular motors into cilia
28.03.2017 | Aarhus University
Asian dust providing key nutrients for California's giant sequoias
28.03.2017 | University of California - Riverside
The Institute of Semiconductor Technology and the Institute of Physical and Theoretical Chemistry, both members of the Laboratory for Emerging Nanometrology (LENA), at Technische Universität Braunschweig are partners in a new European research project entitled ChipScope, which aims to develop a completely new and extremely small optical microscope capable of observing the interior of living cells in real time. A consortium of 7 partners from 5 countries will tackle this issue with very ambitious objectives during a four-year research program.
To demonstrate the usefulness of this new scientific tool, at the end of the project the developed chip-sized microscope will be used to observe in real-time...
Astronomers from Bonn and Tautenburg in Thuringia (Germany) used the 100-m radio telescope at Effelsberg to observe several galaxy clusters. At the edges of these large accumulations of dark matter, stellar systems (galaxies), hot gas, and charged particles, they found magnetic fields that are exceptionally ordered over distances of many million light years. This makes them the most extended magnetic fields in the universe known so far.
The results will be published on March 22 in the journal „Astronomy & Astrophysics“.
Galaxy clusters are the largest gravitationally bound structures in the universe. With a typical extent of about 10 million light years, i.e. 100 times the...
Researchers at the Goethe University Frankfurt, together with partners from the University of Tübingen in Germany and Queen Mary University as well as Francis Crick Institute from London (UK) have developed a novel technology to decipher the secret ubiquitin code.
Ubiquitin is a small protein that can be linked to other cellular proteins, thereby controlling and modulating their functions. The attachment occurs in many...
In the eternal search for next generation high-efficiency solar cells and LEDs, scientists at Los Alamos National Laboratory and their partners are creating...
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are less stable. Now researchers at the Technical University of Munich (TUM) have, for the first time ever, produced a composite material combining silicon nanosheets and a polymer that is both UV-resistant and easy to process. This brings the scientists a significant step closer to industrial applications like flexible displays and photosensors.
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are...
20.03.2017 | Event News
14.03.2017 | Event News
07.03.2017 | Event News
28.03.2017 | Life Sciences
28.03.2017 | Information Technology
28.03.2017 | Physics and Astronomy