The research team unraveled the specific regions of the parrots' genome using a new technology, single molecule sequencing, and fixing its flaws with data from older DNA-decoding devices. The team also decoded hard-to-sequence genetic material from corn and bacteria as proof of their new sequencing approach.
This male budgie from the Fort Worth Zoo is like the parrots Erich Jarvis uses to study vocal learning behaviors. Credit: Jerry Tillery.
The results of the study appeared online July 1 in the journal Nature Biotechnology.
Single molecule sequencing "got a lot of hype last year" because it generates long sequencing reads, "supposedly making it easier to assemble complex parts of the genome," said Duke University neurobiologist Erich Jarvis, a co-author of the study.
He is interested in the sequences that regulate parrots' imitation abilities because they could give neuroscientists information about the gene regions that control speech development in humans.
Jarvis began his project with collaborators by trying to piece together the genome regions with what are known as next-generation sequencers, which read chunks of 100 to 400 DNA base pairs at a time and then take a few days to assemble them into a draft genome. After doing the sequencing, the scientists discovered that the read lengths were not long enough to assemble the regulatory regions of some of the genes that control brain circuits for vocal learning.
University of Maryland computational biologists Adam Phillippy and Sergey Koren -- experts at assembling genomes -- heard about Jarvis's sequencing struggles at a conference and approached him with a possible solution of modifying the algorithms that order the DNA base pairs. But the fix was still not sufficient.
Last year, 1000 base-pair reads by Roch 454 became available, as did the single molecule sequencer by Pacific Biosciences. The Pacbio technology generates strands of 2,250 to 23,000 base pairs at a time and can draft an entire genome in about a day.
Jarvis and others thought the new technologies would solve the genome-sequencing challenges. Through a competition, called the Assemblathon, the scientists discovered that the Pacbio machine had trouble accurately decoding complex regions of the parrot, Melopsittacus undulates, genome. The machine had a high error rate, generating the wrong genetic letter at every fifth or sixth spot in a string of DNA. The mistakes made it nearly impossible to create a genome assembly with the very long reads, Jarvis said.
But with a team, including scientists from the DOE Genome Science Institute and Cold Spring Harbor in New York, Phillippy, Koren and Jarvis corrected the Pacbio sequencer's errors using shorter, more accurate codes from the next-generation devices. The fix reduces the single-molecule, or third-generation, sequencing machine's error rate from 15 percent to less than one-tenth of one percent.
"Finally we have been able to assemble the regulatory regions of genes, such as FoxP2 and egr1, that are of interest to us and others in vocal learning behavior," Jarvis said.
He explained that FoxP2 is a gene required for speech development in humans and vocal learning in birds that learn to imitate sounds, like songbirds and parrots. Erg1 is a gene that controls the brain's ability to reorganize itself based on new experiences.
By being able to decode and organize the DNA that regulates these regions, neuroscientists may be able to better understand what genetic mechanism causes birds to imitate and sing well. They may also be able to collect more information about genetic factors that affect a person's ability to learn how to communicate well and to speak, Jarvis said. He and his team plan to describe the biology of the parrotâs genetic code they sequenced in more detail in an upcoming paper.
Jarvis added that as more scientists use the hybrid sequencing approach, they could possibly decode complex, elusive genes linked to how cancer cells develop and to the sequences that control other brain functions.
Citation: Hybrid error correction and de novo assembly of single-molecule sequencing reads. Koren. S., et. al. Nature Biotechnology. Published online July 1, 2012. DOI: 10.1038/nbt.2280.
Ashley Yeager | EurekAlert!
For a chimpanzee, one good turn deserves another
27.06.2017 | Max-Planck-Institut für Mathematik in den Naturwissenschaften (MPIMIS)
New method to rapidly map the 'social networks' of proteins
27.06.2017 | Salk Institute
An international team of scientists has proposed a new multi-disciplinary approach in which an array of new technologies will allow us to map biodiversity and the risks that wildlife is facing at the scale of whole landscapes. The findings are published in Nature Ecology and Evolution. This international research is led by the Kunming Institute of Zoology from China, University of East Anglia, University of Leicester and the Leibniz Institute for Zoo and Wildlife Research.
Using a combination of satellite and ground data, the team proposes that it is now possible to map biodiversity with an accuracy that has not been previously...
Heatwaves in the Arctic, longer periods of vegetation in Europe, severe floods in West Africa – starting in 2021, scientists want to explore the emissions of the greenhouse gas methane with the German-French satellite MERLIN. This is made possible by a new robust laser system of the Fraunhofer Institute for Laser Technology ILT in Aachen, which achieves unprecedented measurement accuracy.
Methane is primarily the result of the decomposition of organic matter. The gas has a 25 times greater warming potential than carbon dioxide, but is not as...
Hydrogen is regarded as the energy source of the future: It is produced with solar power and can be used to generate heat and electricity in fuel cells. Empa researchers have now succeeded in decoding the movement of hydrogen ions in crystals – a key step towards more efficient energy conversion in the hydrogen industry of tomorrow.
As charge carriers, electrons and ions play the leading role in electrochemical energy storage devices and converters such as batteries and fuel cells. Proton...
Scientists from the Excellence Cluster Universe at the Ludwig-Maximilians-Universität Munich have establised "Cosmowebportal", a unique data centre for cosmological simulations located at the Leibniz Supercomputing Centre (LRZ) of the Bavarian Academy of Sciences. The complete results of a series of large hydrodynamical cosmological simulations are available, with data volumes typically exceeding several hundred terabytes. Scientists worldwide can interactively explore these complex simulations via a web interface and directly access the results.
With current telescopes, scientists can observe our Universe’s galaxies and galaxy clusters and their distribution along an invisible cosmic web. From the...
Temperature measurements possible even on the smallest scale / Molecular ruby for use in material sciences, biology, and medicine
Chemists at Johannes Gutenberg University Mainz (JGU) in cooperation with researchers of the German Federal Institute for Materials Research and Testing (BAM)...
19.06.2017 | Event News
13.06.2017 | Event News
13.06.2017 | Event News
27.06.2017 | Power and Electrical Engineering
27.06.2017 | Information Technology
27.06.2017 | Physics and Astronomy