Using a $2.1 million grant from the National Science Foundation, a group led by computer scientist and astrophysicist Alexander Szalay of Johns Hopkins' Institute for Data Intensive Engineering and Science is designing and developing such a tool, dubbed the Data-Scope.
Once built, the Data-Scope, which is actually a cluster of sophisticated computers capable of handling colossal sets of information, will enable the kind of data analysis tasks that simply are not otherwise possible today, said Szalay, the Alumni Centennial Professor in the Krieger School's Henry A. Rowland Department of Physics and Astronomy.
"Computer science has drastically changed the way we do science and the science that we do, and the Data-Scope is a crucial step in this process," Szalay said. "At this moment, the huge data sets are here, but we lack an integrated software and hardware infrastructure to analyze them. Data-Scope will bridge that gap."
Co-investigators on the Data-Scope project, all from Johns Hopkins, are Kenneth Church, chief scientist for the Human Language Technology Center of Excellence, a Department of Defense-funded center dedicated to advancing technology for the analysis of speech, text and document data; Andreas Terzis, associate professor in the Department of Computer Science at the Whiting School of Engineering; Sarah Wheelan, assistant professor of oncology bioinformatics in the School of Medicine; and Scott Zeger, professor of biostatistics in the Bloomberg School of Public Health and the university's vice provost for research.
Data-Scope will be able to handle 5 petabytes of data. That's the equivalent of 100 million four-drawer file cabinets filled with text. (Fifty petabytes would equal the entire written work of humankind, from the beginning of history until now, in all languages.)
The new apparatus will allow Szalay and a host of other Johns Hopkins researchers (not to mention those at other institutions, including universities and national laboratories such as Los Alamos in New Mexico and Oak Ridge in Tennessee) to conduct research directly in the database, which is where Szalay contends that more and more science is being done.
"The Data-Scope will allow us to mine out relationships among data that already exist, but that we can't yet handle, and to sift discoveries from what seems like an overwhelming flow of information," he said. "New discoveries will definitely emerge this way. There are relationships and patterns that we just cannot fathom buried in that onslaught of data. Data-Scope will tease these out."
According to Szalay, there are at least 20 research groups within Johns Hopkins that are grappling with data problems totaling 3 petabytes. (Three petabytes is equal to about 20 billion photos on Facebook.) Without Data-Scope, "they would have to wait years in order to analyze that amount of data," Szalay said.
The two-year NSF grant, to be supplemented with almost $1 million from Johns Hopkins, will underwrite the design and building of the new instrument and its first year of operation, expected to begin in May 2011. Szalay said that the range of material that the Data-Scope will handle will be "breathtakingly large, from genomics to ocean circulation, turbulence, astrophysics, environmental science, public health and beyond."
"There really is nothing like this at any university right now," Szalay said. "Such systems usually take many years to build up, but we are doing it much more quickly. It's similar to what Google is doing-of course on a thousand-times-larger scale than we are. This instrument will be the best in the academic world, bar none."
Zeger said he is excited about the research possibilities and collaborations that the new instrument will make possible.
"The NSF funding of a high-performance computing system, specially designed by Dr. Szalay and his team to solve large computational problems, will contribute to Johns Hopkins' remaining in the forefront of many areas, including biomedicine, where I work," he said. "The new genomic data are voluminous. Their analysis requires machines faster than are currently available. Dr. Szalay's machine will enable our biomedical and computational scientists to work together to solve problems that would have been beyond them otherwise."
Jonathan Bagger, vice provost for graduate and postdoctoral programs and special projects, said he believes that the Data-Scope positions Johns Hopkins to play a crucial role in the next revolution in science: data analysis.
"The Data-Scope is specially designed to bring large amounts of data literally under the microscope," he said. "By manipulating data in new ways, Johns Hopkins researchers will be able to advance their science in ways never before possible. I am excited that Johns Hopkins is in the forefront of this new field of inquiry: developing the calculus of the 21st century."
The instrument will be part of a new energy-efficient computing center that is being constructed in the basement of the Bloomberg Center for Physics and Astronomy on the Homewood campus. The house-sized room once served as a mission control center for the Far Ultraviolet Spectroscopic Explorer, a NASA satellite. This computing center is being built using a $1.3 million federal stimulus grant from the National Science Foundation.
Lisa De Nike | Newswise Science News
Next Generation Cryptography
20.03.2018 | Fraunhofer-Institut für Sichere Informationstechnologie SIT
TIB’s Visual Analytics Research Group to develop methods for person detection and visualisation
19.03.2018 | Technische Informationsbibliothek (TIB)
Satellites in near-Earth orbit are at risk due to the steady increase in space debris. But their mission in the areas of telecommunications, navigation or weather forecasts is essential for society. Fraunhofer FHR therefore develops radar-based systems which allow the detection, tracking and cataloging of even the smallest particles of debris. Satellite operators who have access to our data are in a better position to plan evasive maneuvers and prevent destructive collisions. From April, 25-29 2018, Fraunhofer FHR and its partners will exhibit the complementary radar systems TIRA and GESTRA as well as the latest radar techniques for space observation across three stands at the ILA Berlin.
The "traffic situation" in space is very tense: the Earth is currently being orbited not only by countless satellites but also by a large volume of space...
An international team of researchers has discovered a new anti-cancer protein. The protein, called LHPP, prevents the uncontrolled proliferation of cancer cells in the liver. The researchers led by Prof. Michael N. Hall from the Biozentrum, University of Basel, report in “Nature” that LHPP can also serve as a biomarker for the diagnosis and prognosis of liver cancer.
The incidence of liver cancer, also known as hepatocellular carcinoma, is steadily increasing. In the last twenty years, the number of cases has almost doubled...
In just a few weeks from now, the Chinese space station Tiangong-1 will re-enter the Earth's atmosphere where it will to a large extent burn up. It is possible that some debris will reach the Earth's surface. Tiangong-1 is orbiting the Earth uncontrolled at a speed of approx. 29,000 km/h.Currently the prognosis relating to the time of impact currently lies within a window of several days. The scientists at Fraunhofer FHR have already been monitoring Tiangong-1 for a number of weeks with their TIRA system, one of the most powerful space observation radars in the world, with a view to supporting the German Space Situational Awareness Center and the ESA with their re-entry forecasts.
Following the loss of radio contact with Tiangong-1 in 2016 and due to the low orbital height, it is now inevitable that the Chinese space station will...
Fraunhofer Institute for Organic Electronics, Electron Beam and Plasma Technology FEP, provider of research and development services for OLED lighting solutions, announces the founding of the “OLED Licht Forum” and presents latest OLED design and lighting solutions during light+building, from March 18th – 23rd, 2018 in Frankfurt a.M./Germany, at booth no. F91 in Hall 4.0.
They are united in their passion for OLED (organic light emitting diodes) lighting with all of its unique facets and application possibilities. Thus experts in...
A new scenario seeking to explain how Mars' putative oceans came and went over the last 4 billion years implies that the oceans formed several hundred million...
23.03.2018 | Event News
19.03.2018 | Event News
16.03.2018 | Event News
23.03.2018 | Materials Sciences
23.03.2018 | Agricultural and Forestry Science
23.03.2018 | Physics and Astronomy