Despite being led and funded by a private company, Mars Inc., Cacao Genome Database scientists say one of their chief concerns has been making sure the Theobroma cacao genome data was published for all to see -- especially cacao farmers and breeders in West Africa, Asia and South America, who can use genetic information to improve their planting stocks and protect their often-fragile incomes.
"When you have to wait three or more years for a tree you plant to bear the beans you sell, you want as much information as possible about the seedlings you're planting," said Keithanne Mockaitis, IU Center for Genomics and Bioinformatics (CGB) sequencing director and IU project leader. "We expect this information will positively impact some of the poorest regions in the world, where tropical tree crops are grown. Making the genome data public further enables breeders, farmers and researchers around the world to use a common set of tools, and to share information that will help them fight the spread of disease in their crops."
Mockaitis, a biochemist-turned-genomicist, joined the project in early 2009, and quickly set to work with her collaborators to tackle the challenge of sequencing and accurately pasting together the approximately 400 million base pairs of the tree's genome. Mockaitis' Cacao Genome Group partners at the U.S. Department of Agriculture's Subtropical Horticulture Research Station in Miami sent samples to Bloomington, and these were prepared and sequenced in a redundant manner by her sequencing team in the CGB genomics laboratory. Sequence of some of the same material was generated using additional methods in laboratories of the USDA Agricultural Research Service (USDA-ARS) and at the National Center for Genome Resources in Santa Fe, N.M.
Raw data were then sent to HudsonAlpha Biotechnology Institute, a partner of the U.S. Department of Energy-funded Joint Genome Institute, for assembly. Other important datasets generated by Mockaitis' group were not the sequences of the DNA itself, but of the RNA, or transcripts produced in different tissues of the tree. Transcript sequences reveal which genes are expressed (turned on).
Finally, IU Bloomington Department of Biology scientist Don Gilbert analyzed both the genome and transcriptome sequences and generated the annotations that point to the locations in which each active gene and its components (exons and introns) reside.
"The final number of genes is still being counted and validated, but we currently estimate the cacao plant has about 35,000 genes," Mockaitis said.
That's a typical gene number for flowering plants whose genomes have thus far been sequenced. Humans have approximately 30,000 genes. Rice has about 40,000.
Since its inception about 11 years ago, the CGB has been involved in dozens of different projects that address the workings of different species' genomes with the use of high-throughput technologies.
"Cacao is something of a first for us," Mockaitis said. "This is the largest genome the CGB has sequenced to date. As a group we now have more experience and more resources to take on a wider variety of projects."
Mockaitis says the relative efficiency of the project so far has been due to Mars' support of the academic and non-profit contributing laboratories.
"We've benefited from having a collegial group of researchers, from the USDA-ARS and a variety of genomics-focused laboratories, that each bring different scientific expertise to the table to complete this genome. It's also been particularly inspiring to see West African cacao researchers come to some of our meetings -- they listen to us talk about the esoteric technologies we're using, and we know that they'll soon go to work and start benefitting from the data. That's a rare treat for an academic researcher."
Mockaitis was first introduced to this project through Roche Diagnostics, based in Indianapolis, which owns the 454 Sequencing technology. Her group had developed improved methods for the sequencing of transcripts (active gene products, above), and was asked to contribute some data to the project. Since then the IU CGB has been able to contribute to the sequence of the genome itself as well.
Unlike some other food products, such as corn or wheat, which are often grown on large, industrial farms, cacao is almost exclusively grown on small farms. There are about 6.5 million chocolate farmers around the world, primarily in West Africa, northern South America, and Southeast Asia. The United States produces virtually no chocolate on its own, instead opting to engage cacao-growing countries with economic policies that support the production and trade of what may be the world's most popular food.
"Genome sequencing helps eliminate much of the guess-work of traditional crop cultivation," said Howard-Yana Shapiro, global staff officer of plant science and research at Mars Inc. "Cocoa is what some researchers describe as an 'orphan crop' because it has been the subject of little agricultural research compared to corn, wheat and rice. This effort, which will allow fast and accurate traditional breeding, is about applying the best of what science has to offer in taking an under-served crop and under-served population and giving them both the chance to flourish."
Mockaitis says she hopes the project will have a positive impact on the farmers' lives and livelihoods.
"It is an export crop that can reduce poverty," she said. "I believe the work our groups have done will eventually help small farmers stay in business over time, because improved breeding programs based on reliable genome data will give them plants naturally equipped to fight off disease and to thrive in their specific location. This will lead to more sustainable crops and of course a more stable chocolate supply for all of us -- pretty important!"
IU's participation in the project was wholly funded by Mars Inc. Mars is a privately held company that produces a number of chocolate products, including M&Ms. Computing and data transmission were accomplished using the NSF-funded TeraGrid, a national resource co-maintained by IU in Indianapolis.
The Cacao Genome Database is a consortium of academic, government and industry partners, including Mars Inc., the U.S. Department of Agriculture, Indiana University, Clemson University, Washington State University, the National Center for Genome Resources, the nonprofit PIPRA (Public Intellectual Property Resources for Agriculture), and the HudsonAlpha Institute.
Keithanne Mockaitis led IU's participation in the project, which continues this year to add important details to the cacao genome. Other IU contributors are Research Associates Zach Smith and James Ford, who perform sequencing in the laboratory using advanced technologies, and data analysts Ram Podicheti and Justin Choi, all of the CGB. Bioinformaticist Don Gilbert is a member of the IU Bloomington Department of Biology.
To speak with IU Project Leader Keithanne Mockaitis, please contact David Bricker, University Communications, at 812-856-9035 or firstname.lastname@example.org.
David Bricker | EurekAlert!
New cellular pathway helps explain how inflammation leads to artery disease
22.06.2018 | Cedars-Sinai Medical Center
Exposure to fracking chemicals and wastewater spurs fat cell development
22.06.2018 | Duke University
In a recent publication in the renowned journal Optica, scientists of Leibniz-Institute of Photonic Technology (Leibniz IPHT) in Jena showed that they can accurately control the optical properties of liquid-core fiber lasers and therefore their spectral band width by temperature and pressure tuning.
Already last year, the researchers provided experimental proof of a new dynamic of hybrid solitons– temporally and spectrally stationary light waves resulting...
Scientists from the University of Freiburg and the University of Basel identified a master regulator for bone regeneration. Prasad Shastri, Professor of...
Moving into its fourth decade, AchemAsia is setting out for new horizons: The International Expo and Innovation Forum for Sustainable Chemical Production will take place from 21-23 May 2019 in Shanghai, China. With an updated event profile, the eleventh edition focusses on topics that are especially relevant for the Chinese process industry, putting a strong emphasis on sustainability and innovation.
Founded in 1989 as a spin-off of ACHEMA to cater to the needs of China’s then developing industry, AchemAsia has since grown into a platform where the latest...
The BMBF-funded OWICELLS project was successfully completed with a final presentation at the BMW plant in Munich. The presentation demonstrated a Li-Fi communication with a mobile robot, while the robot carried out usual production processes (welding, moving and testing parts) in a 5x5m² production cell. The robust, optical wireless transmission is based on spatial diversity; in other words, data is sent and received simultaneously by several LEDs and several photodiodes. The system can transmit data at more than 100 Mbit/s and five milliseconds latency.
Modern production technologies in the automobile industry must become more flexible in order to fulfil individual customer requirements.
An international team of scientists has discovered a new way to transfer image information through multimodal fibers with almost no distortion - even if the fiber is bent. The results of the study, to which scientist from the Leibniz-Institute of Photonic Technology Jena (Leibniz IPHT) contributed, were published on 6thJune in the highly-cited journal Physical Review Letters.
Endoscopes allow doctors to see into a patient’s body like through a keyhole. Typically, the images are transmitted via a bundle of several hundreds of optical...
13.06.2018 | Event News
08.06.2018 | Event News
05.06.2018 | Event News
22.06.2018 | Life Sciences
22.06.2018 | Physics and Astronomy
22.06.2018 | Life Sciences