The new data analysis and visualization tool offers improved ease of use compared to similar tools, the researchers say, and could be readily adapted for use with existing data sets in a variety of disciplines.
The data analysis and visualization tool is a subset of the Test Matrix Tool (TMT), a multi-component system developed by GTRI for designing, executing and analyzing large-scale modeling and simulation data sets. The visualization capability offers a graphical user interface that provides both on-screen data-manipulation features like filters and the ability to see query results in the form of graphical images almost instantly.
“Data visualization supports data analysis by letting users pose data-related questions onscreen with ease and then view the answers in ways that go far beyond ordinary table formats,” said Edward Clarkson, a GTRI research scientist who is leading the data visualization work. “A picture can be worth a thousand numbers, because visualizing data in a graph allows us to see patterns that might not be apparent from purely numerical results.”
Development of the Test Matrix Tool and its components is being led by Greg Rohling, a GTRI principal research engineer. Rohling’s team developed the TMT to support modeling and simulation investigations into the effectiveness and optimization of numerous U.S. defense systems, including electronic warfare equipment used to protect military aircraft. The work is supported by the Warner Robins Air Logistics Center at Robins Air Force Base.
In developing a simulation test, Test Matrix Tool users can specify desired variations in input parameters using multiple data filters. The TMT system executes all possible combinations of those parameters, creating a test matrix. It then executes the simulations on a Sun/Oracle Grid Engine and stores the resulting simulation output data in a MySQL database.
At that point, TMT’s data analysis component, which includes the data visualization tool, helps users evaluate the often complex test results. By collating the test matrix input and output, the data analysis tools allow users to efficiently filter and visualize test matrix data.
The Test Matrix Tool is designed for use on personal computers. It works under the Linux and Microsoft Windows operating systems.
Some TMT capabilities, including the data analysis and visualization components, could be useful for scrutinizing information gathered in many disciplines, Clarkson said. He mentioned health care as one field where a multitude of existing data sets could be mined for new insights.
“For example, there’s an enormous amount of data out there on heart patients,” he said. “Our data tools could be used to investigate existing patient information and seek significant trends in the data.”
Clarkson explained that users would face the challenge of organizing legacy data sets into formats that the GTRI data analysis software can exploit. But that task, he added, is generally straightforward and can be performed with automated tools in many cases.
The data format required by the TMT tools, he explained, is not particularly complex. What’s needed is a standard database setup in which the information fields are organized into tabular formats. Moreover, any required metadata – special data that tell the system how to deal with a particular data set – would likely present few development issues.
Clarkson recently demonstrated the capabilities of the data analysis and visualization tool using an existing database: baseball statistics. This particular demonstration involved the use of 40 different data filters available onscreen; the TMT system allows for 300 or more such filters.
In a random query of the 46,000 National League players from the past, an onscreen graph unexpectedly revealed an interesting anomaly during the demonstration. The data indicate that players’ height and weight increased in every past decade except the 1920s and 1930s, when it stayed inexplicably flat.
“That’s the beauty of this kind of tool – it can find the unknown unknowns,” Clarkson observed. “Details show up in graphs that aren’t obvious when you’re looking at just the numbers.”
The TMT data visualization tool, he explained, bears some similarities to the data filtering features found on some websites. For example, many shopping sites let users search for products by using filters to select desired qualities such as size, color and brand name.
However, Clarkson said, TMT’s capabilities are considerably more advanced. Whereas commercial systems stop at the filtering stage, the TMT data analysis tools allow fundamental manipulation of the data. Using filters, investigators can transform the data mathematically, a process that makes unique insights and discoveries possible.
“Data analysis and visualization are great for finding many things you want to know,” Clarkson said. “But another real advantage is that they can detect what you perhaps don’t want to know – the bugs and the anomalies -- the things that just aren’t right and have to be fixed.”Research News & Publications Office
Writer: Rick Robinson
John Toon | Newswise Science News
Cutting edge research for the industries of tomorrow – DFKI and NICT expand cooperation
21.03.2017 | Deutsches Forschungszentrum für Künstliche Intelligenz GmbH, DFKI
Molecular motor-powered biocomputers
20.03.2017 | Technische Universität Dresden
Astronomers from Bonn and Tautenburg in Thuringia (Germany) used the 100-m radio telescope at Effelsberg to observe several galaxy clusters. At the edges of these large accumulations of dark matter, stellar systems (galaxies), hot gas, and charged particles, they found magnetic fields that are exceptionally ordered over distances of many million light years. This makes them the most extended magnetic fields in the universe known so far.
The results will be published on March 22 in the journal „Astronomy & Astrophysics“.
Galaxy clusters are the largest gravitationally bound structures in the universe. With a typical extent of about 10 million light years, i.e. 100 times the...
Researchers at the Goethe University Frankfurt, together with partners from the University of Tübingen in Germany and Queen Mary University as well as Francis Crick Institute from London (UK) have developed a novel technology to decipher the secret ubiquitin code.
Ubiquitin is a small protein that can be linked to other cellular proteins, thereby controlling and modulating their functions. The attachment occurs in many...
In the eternal search for next generation high-efficiency solar cells and LEDs, scientists at Los Alamos National Laboratory and their partners are creating...
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are less stable. Now researchers at the Technical University of Munich (TUM) have, for the first time ever, produced a composite material combining silicon nanosheets and a polymer that is both UV-resistant and easy to process. This brings the scientists a significant step closer to industrial applications like flexible displays and photosensors.
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are...
Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to simulate these confined natural conditions in artificial vesicles for the first time. As reported in the academic journal Small, the results are offering better insight into the development of nanoreactors and artificial organelles.
Enzymes behave differently in a test tube compared with the molecular scrum of a living cell. Chemists from the University of Basel have now been able to...
20.03.2017 | Event News
14.03.2017 | Event News
07.03.2017 | Event News
23.03.2017 | Life Sciences
23.03.2017 | Power and Electrical Engineering
23.03.2017 | Earth Sciences