The explosive growth of genomic and proteomic data has ushered in a new era of molecular medicine in which cancer detection, diagnosis and treatment are tailored to each individual's molecular profile.
Developed by May Dongmei Wang and her team in the Wallace H. Coulter Department of Biomedical Engineering at Georgia Tech and Emory University, the programs -- caCORRECT and omniBioMarker -- remove noise and artifacts, and identify and validate biomarkers from microarray data. Funding to develop the programs was provided by the National Institutes of Health, the Georgia Cancer Coalition, Microsoft Research and Hewlett-Packard.
caCORRECT -- chip artifact CORRECTion -- is a software program that improves the quality of collected microarray data, ultimately leading to improved biomarker selection. Widely used Affymetrix microarrays contain thousands of probes, each including a 25-oligo sequence, which are used to detect mRNA expression levels.
"Once someone has collected microarray data, it is important to run quality control on it and remove any problematic points of data that could highlight incorrect biomarkers when analyzed," explained Wang, who is also director of the biocomputing and bioinformatics core in the Emory-Georgia Tech National Cancer Institute Center for Cancer Nanotechnology Excellence (CCNE).
Since each microarray chip contains thousands of spots, it is easy for a few spots to become marred by artifacts and noise. These unusable portions are typically the result of experimental variations by different laboratory technicians or errors that create scratches, edge effects and bubble effects on the data.
caCORRECT removes the noise and artifacts from the data, while retaining high-quality genes on the array. The software can also effectively recover lost information that has been obscured by artifacts.
In collaboration with Andrew N. Young, an associate professor in pathology and laboratory medicine at Emory University School of Medicine and clinical laboratory director at Grady Health System, Wang and graduate students Todd Stokes, Martin Ahrens and Richard Moffitt validated the caCORRECT software. A large-scale survey of public data and data from Young's laboratory demonstrated the ability of caCORRECT to assess and improve the quality of a wide array of datasets.
"caCORRECT is a quality assurance tool that allows researchers to utilize and trust imperfect experimental microarray data that they spent a tremendous amount of time and money to generate," added Wang. "caCORRECT improves the downstream analysis of microarray data and should be used before conducting biomarker selection, therapeutic target studies, or pathway analysis studies in bioinformatics and systems biology."
Once the quality of the data is assured with caCORRECT, researchers can use the caBIG-certified omniBioMarker software to identify and validate biomarkers from the high-throughput gene expression data.
Candidate cancer biomarkers are typically genes expressed at different levels in cancer patients compared to healthy subjects. omniBioMarker searches these groups of patient data for genes with the highest potential for accurately determining whether a patient has cancer. However, because individual genes are not expressed independently, the software also identifies groups of genes that act in concert.
The advantage of the omniBioMarker software is that it fine-tunes biomarker selection to a particular dataset or clinical problem based on prior biological knowledge. It also applies unique analysis parameters for each specific clinical problem. The parameters are optimal when the software selects genes that are known to be relevant biomarkers based on clinical observations and laboratory experiments available in literature and public databases. Then the software finds new potential biomarkers for experimental validation.
Wang, graduate student John Phan and Young tested the ability of the software to identify biomarkers in clinical renal cancer microarray data. The researchers selected renal cancer for study because it has several distinct subtypes, which can appear in the same person in varying degrees and must be treated according to the diagnosed subtype to maximize treatment success. The results indicate that integrating prior laboratory and clinical knowledge with the microarray data improves biomarker selection.
"Using omniBioMarker to create an optimal metric for ranking and identifying novel biomarkers reduces the number of false discoveries, increases the number of true discoveries, reduces the required time for validation and increases the overall efficiency of the process," noted Wang.
Since receiving caBIG silver-level compatibility certification for caCORRECT and omniBioMarker, Wang and her team have been working on getting two more software programs certified -- Q-IHC, a tool that analyzes and quantifies multi-spectral images such as quantum dot-stained histopathological images, and omniVisGrid, a grid-based tool that visualizes data and analysis processes of microarrays, biological pathways and clinical outcomes.
This work was funded by grant numbers R01CA108468, P20GM072069 and U54CA119338 from the National Institutes of Health (NIH). The content is solely the responsibility of the principal investigator and does not necessarily represent the official view of the NIH.
Abby Vogel | EurekAlert!
Information integration and artificial intelligence for better diagnosis and therapy decisions
24.05.2017 | Fraunhofer MEVIS - Institut für Bildgestützte Medizin
World's thinnest hologram paves path to new 3-D world
18.05.2017 | RMIT University
Physicists from the University of Würzburg are capable of generating identical looking single light particles at the push of a button. Two new studies now demonstrate the potential this method holds.
The quantum computer has fuelled the imagination of scientists for decades: It is based on fundamentally different phenomena than a conventional computer....
An international team of physicists has monitored the scattering behaviour of electrons in a non-conducting material in real-time. Their insights could be beneficial for radiotherapy.
We can refer to electrons in non-conducting materials as ‘sluggish’. Typically, they remain fixed in a location, deep inside an atomic composite. It is hence...
Two-dimensional magnetic structures are regarded as a promising material for new types of data storage, since the magnetic properties of individual molecular building blocks can be investigated and modified. For the first time, researchers have now produced a wafer-thin ferrimagnet, in which molecules with different magnetic centers arrange themselves on a gold surface to form a checkerboard pattern. Scientists at the Swiss Nanoscience Institute at the University of Basel and the Paul Scherrer Institute published their findings in the journal Nature Communications.
Ferrimagnets are composed of two centers which are magnetized at different strengths and point in opposing directions. Two-dimensional, quasi-flat ferrimagnets...
An Australian-Chinese research team has created the world's thinnest hologram, paving the way towards the integration of 3D holography into everyday...
In the race to produce a quantum computer, a number of projects are seeking a way to create quantum bits -- or qubits -- that are stable, meaning they are not much affected by changes in their environment. This normally needs highly nonlinear non-dissipative elements capable of functioning at very low temperatures.
In pursuit of this goal, researchers at EPFL's Laboratory of Photonics and Quantum Measurements LPQM (STI/SB), have investigated a nonlinear graphene-based...
24.05.2017 | Event News
23.05.2017 | Event News
22.05.2017 | Event News
24.05.2017 | Physics and Astronomy
24.05.2017 | Physics and Astronomy
24.05.2017 | Event News