The work was reported in two recent papers in Genome Research, published online on July 3 and Sept. 27.
“Our goal is to understand how regulatory information is encrypted and to learn which sequence variations contribute to medical risks,” says Andrew McCallion, Ph.D., associate professor of molecular and comparative pathobiology in the McKusick-Nathans Institute of Genetic Medicine at Hopkins.
“We give data to a computer and ‘teach it’ to distinguish between data that has no biological value versus data that has this or that biological value. It then establishes a set of rules, which allows it to look at new sets of data and apply what it learned. We’re basically sending our computers to school.”
These state-of-the-art “machine learning” techniques were developed by Michael Beer, Ph.D., assistant professor of biomedical engineering at the Johns Hopkins School of Medicine, and by Ivan Ovcharenko, Ph.D., at the National Center for Biotechnology Information. The researchers began both studies by creating “training sets” for their computers to “learn” from. These training sets were lists of DNA sequences taken from regions of the genome, called enhancers, that are known to increase the activity of particular genes in particular cells.
For the first of their studies, McCallion’s team created a training set of enhancer sequences specific to a particular region of the brain by compiling a list of 211 published sequences that had been shown, by various studies in mice and zebrafish, to be active in the development or function of that part of the brain.
For a second study, the team generated a training set through experiments of their own. They began with a purified population of mouse melanocytes, which are the skin cells that produce the pigment melanin that gives color to skin and absorbs harmful UV rays from the sun. The researchers used a technique called ChIP-seq (pronounced “chip seek”) to collect and sequence all of the pieces of DNA that were bound in those cells by special enhancer-binding proteins, generating a list of about 2,500 presumed melanocyte enhancer sequences.
Once the researchers had these two training sets for their computers, one specific to the brain and another to melanocytes, the computers were able to distinguish the features of the training sequences from the features of all other sequences in the genome, and create rules that defined one set from the other. Applying those rules to the whole genome, the computers were able to discover thousands of probable brain or melanocyte enhancer sequences that fit the features of the training sets.
In the brain study, the computers identified 40,000 probable brain enhancer sequences; for melanocytes, 7,500. Randomly testing a subset of each batch of sequences, the scientists found that more than 85 percent of the predicted enhancer sequences enhanced gene activity in the brain or in melanocytes, as expected, verifying the predictive power of their approach.
The researchers say that, in addition to identifying specific DNA sequences that control the genetic activity of a particular organ or cell type, these studies contribute to our understanding of enhancers in general and have validated an experimental approach that can be applied to many other biological questions as well.
Authors on the brain paper include Grzegorz Burzynski, Xylena Reed, Zachary Stine, Takeshi Matsui and Andrew McCallion from The Johns Hopkins University, and Leila Taher and Ivan Ovcharenko from the National Center for Biotechnology Information.
Authors on the melanocyte paper include David Gorkin, Dongwon Lee, Xylena Reed, Christopher Fletez-Brant, Seneca Bessling, Michael Beer and Andrew McCallion from The Johns Hopkins University, and Stacie Loftus and William Pavan from the National Human Genome Research Institute.
This work was supported by grants from the National Institute of Neurological Disorders and Stroke (NS062972), the National Human Genome Research Institute’s Intramural Research Program, the National Library of Medicine, the National Institute of General Medical Sciences (GM07814, GM071648), the National Science Foundation and the Searle Scholars Program.
Catherine Kolf | Source: Newswise Science News
Further information: www.jhmi.edu
Further Reports about: Biotechnology > DNA > DNA sequence > Genom > Genome Research > Human Genome Research > Human vaccine > Medicine > regulating > sequences > skin cell
More articles from Life Sciences:
New way to improve antibiotic production
18.06.2013 | Norwich BioScience Institutes
Missing enzyme linked to drug addiction
18.06.2013 | The Endocrine Society
... two engines aircraft project “Elektro E6”.
The countdown has been started for opening the gates again for the worldwide leading aviation and space event in Le Bourget, Paris from June 17th - 23rd, 2013.
EADCO & PC-Aero will present at the Paris Air Show in Hall H4 booth F-7 their new future aircraft and innovative project: ...
Siemens scientists have developed new kinds of ceramics in which they can embed transformers.
The new development allows power supply transformers to be reduced to one fifth of their current size so that the normally separate switched-mode power supply units of light-emitting diodes can be integrated into the module's heat sink.
The new technology was developed in cooperation with industrial and research partners who ...
Cheaper clean-energy technologies could be made possible thanks to a new discovery.
Led by Raymond Schaak, a professor of chemistry at Penn State University, research team members have found that an important chemical reaction that generates hydrogen from water is effectively triggered -- or catalyzed -- by a nanoparticle composed of nickel and phosphorus, two inexpensive elements that are abundant on Earth. ...
The Fraunhofer Institute for Laser Technology ILT generated a lot of interest at the LASER World of Photonics 2013 trade fair with its numerous industrial laser technology innovations.
Its highlights included beam sources and manufacturing processes for ultrashort laser pulses as well as ways to systematically optimize machining processes using computer simulations. There was even a specialist booth at the fair dedicated to the revolutionary technological potential of digital photonic production.
Now in its fortieth year, LASER World ...
It's not reruns of "The Jetsons", but researchers working at the National Institute of Standards and Technology (NIST) have developed a new microscopy technique that uses a process similar to how an old tube television produces a picture—cathodoluminescence—to image nanoscale features.
Combining the best features of optical and scanning electron microscopy, the fast, versatile, and high-resolution technique allows scientists to view surface and subsurface features potentially as small as 10 nanometers in size.
The new microscopy technique, described in the journal AIP Advances,* uses a beam of electrons to excite a specially ...
18.06.2013 | Materials Sciences
Artificial Sweetener a Potential Treatment for Parkinson's Disease
18.06.2013 | Health and Medicine
New way to improve antibiotic production
18.06.2013 | Life Sciences
International Symposium on Morphogenesis
14.06.2013 | Event News
ESMT Annual Forum: CEOs discuss “The Future of Jobs” with international academics and policymakers
13.06.2013 | Event News
Invitation: Mathematics for Industry and Society in the French Embassy Berlin, 04. - 05.07.2013
10.06.2013 | Event News