Scientists in the new but fast-growing field of computational genomics are facing a similar dilemma. In recent decades, these researchers have begun to assemble the chemical blueprints of the DNA found in humans, animals, plants and microbes, unlocking a door that will likely lead to better healthcare and greatly expanded life-science knowledge. But a major obstacle now threatens the speedy movement of DNA’s secrets into research labs, two scholars in the field are warning.
This logjam has occurred, the researchers say, because a flood of unassembled genetic data is being produced much faster than current computers can turn it into useful information. That’s the premise of a new article, co-written by a Johns Hopkins bioinformatics expert and published in the July 2013 issue of IEEE Spectrum. The piece, titled “DNA and the Data Deluge,” was co-authored by Michael C. Schatz, an assistant professor of quantitative biology at Cold Spring Harbor Laboratory, in New York state; and Ben Langmead, an assistant professor of computer science in Johns Hopkins’ Whiting School of Engineering.
In their article, the authors trace the rapidly increasing speed and declining cost of machines called DNA sequencers, which chop extremely long strands of biochemical components into more manageable small segments. But, the authors point out, these sequencers do not yield important biological information that researchers “can read like a book. Instead, [they] generate something like an enormous stack of shredded newspapers, without any organization of the fragments. The stack is far too large to deal with manually, so the problem of sifting through all the fragments is delegated to computer programs.”
In other words, the sequencers produce the genetic jigsaw pieces, and a computer is needed to assemble the picture. Therein lies the problem, Schatz and Langmead say: improvements in these computer programs have not kept pace with the enhancements and widespread use of the sequencers that are cranking out huge amounts of data. The result is, the puzzle cannot be pieced together in a timely manner. “It’s a problem that threatens to hold back this revolutionary technology,” the authors say in their article. “Computing, not sequencing, is now the slower and more costly aspect of genomics research.”
The authors then detail possible computing solutions that could help erase this digital bottleneck. In his own research at Johns Hopkins, co-author Langmead is working on some of these remedies. “The battle is really taking place on two fronts,” he said. “We need algorithms that are more clever at solving these data issues, and we need to harness more computing power.”
An algorithm is a recipe or a series of steps—such as searching through data or doing math calculations—that a computer must complete in order to accomplish a task. “With cleverer algorithms,” Langmead said, “you can do more steps with a fixed amount of computing power and time—and get more work done.”
The Johns Hopkins researcher has also had extensive experience in the second digital battle zone: assembling more computing power. This can be accomplished by putting multiple computers to work on assembling the DNA jigsaw puzzle. The linked machines can be at a single location or at multiple sites connected over the internet through the approach known as cloud computing. For the latter option, Langmead said, scientists may be able to do their work more quickly by tapping into the huge computing centers run by companies such as Amazon and “renting” time on these systems.
Langmead said he and Schatz wrote the IEEE Spectrum article to call attention to a significant computing problem and to jumpstart efforts to address it. The magazine describes itself as the flagship publication of the IEEE, the world’s largest professional technology association. “We hope the people who read our article can contribute to some solutions and make the work of genomic scientist much easier,” he said.The article can be viewed online at:
Posted in Engineering, Medicine and Nursing, TechnologyOffice of Communications
Phil Sneiderman | EurekAlert!
When Air is in Short Supply - Shedding light on plant stress reactions when oxygen runs short
23.03.2017 | Institut für Pflanzenbiochemie
WPI team grows heart tissue on spinach leaves
23.03.2017 | Worcester Polytechnic Institute
Astronomers from Bonn and Tautenburg in Thuringia (Germany) used the 100-m radio telescope at Effelsberg to observe several galaxy clusters. At the edges of these large accumulations of dark matter, stellar systems (galaxies), hot gas, and charged particles, they found magnetic fields that are exceptionally ordered over distances of many million light years. This makes them the most extended magnetic fields in the universe known so far.
The results will be published on March 22 in the journal „Astronomy & Astrophysics“.
Galaxy clusters are the largest gravitationally bound structures in the universe. With a typical extent of about 10 million light years, i.e. 100 times the...
Researchers at the Goethe University Frankfurt, together with partners from the University of Tübingen in Germany and Queen Mary University as well as Francis Crick Institute from London (UK) have developed a novel technology to decipher the secret ubiquitin code.
Ubiquitin is a small protein that can be linked to other cellular proteins, thereby controlling and modulating their functions. The attachment occurs in many...
In the eternal search for next generation high-efficiency solar cells and LEDs, scientists at Los Alamos National Laboratory and their partners are creating...
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are less stable. Now researchers at the Technical University of Munich (TUM) have, for the first time ever, produced a composite material combining silicon nanosheets and a polymer that is both UV-resistant and easy to process. This brings the scientists a significant step closer to industrial applications like flexible displays and photosensors.
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are...
Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to simulate these confined natural conditions in artificial vesicles for the first time. As reported in the academic journal Small, the results are offering better insight into the development of nanoreactors and artificial organelles.
Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to...
20.03.2017 | Event News
14.03.2017 | Event News
07.03.2017 | Event News
23.03.2017 | Life Sciences
23.03.2017 | Power and Electrical Engineering
23.03.2017 | Earth Sciences