An easy-to-use bioinformatics interface has been developed by a research group led by Tetsuro Toyoda called the RIKEN Bioinformatics And Systems Engineering division (BASE), Yokohama. The web-service-based tool, called Semantic-JSON, and the portal, BioLOD, integrate access to information contained within genomics, proteomics, and other ‘omics’-based data repositories.
“Advances in life sciences increasingly depend upon cross-analysis and integration of diverse information from multiple large databases maintained on remote computer servers,” explains Toyoda. “The challenge is to facilitate data retrieval, integration and collaboration while maintaining database security.”
As a first step, various research organizations worldwide, including RIKEN, recently published 192 public and 190 private mammalian, plant and protein databases. The data are integrated by SciNetS.org, the Scientists’ Networking System. These databases contain more than 8.2 million individual data records.
Pioneering a new global trend, BASE provides ‘structured linked open data’ and private data via the newly developed BioLOD.org portal connecting with the World Wide Web Consortium (W3C) Linking Open Data initiative. These self-described data are interlinked using standard web technologies allowing automatic reading by computers, thereby making them more useful to researchers. The system facilitates information sharing and collaboration between researchers, but brings new challenges.
“The sheer amount of data contained in our biological data cloud outstripped the capacity of existing bioinformatics interfaces to cope with the complexity of researcher queries, motivating us to develop Semantic-JSON,” explains Toyoda.
Semantic-JSON has two major components. The secured, unified data repository integrates data meaningfully—or ‘semantically’ in computer parlance— from numerous sources. The web-based interface allows researchers to retrieve linked data seamlessly and securely using established bioinformatics programming languages and processing. Bioinformatics researchers can then use their specialized computational tools to analyze raw biological data (Fig. 1).
Databases already available through Semantic-JSON and BioLOD.org include the RIKEN Integrated Database of Mammals with 79 human and mouse omics databases, the RIKEN Integrated Database of Plants incorporating 30 similar databases for the plant species Arabidopsis thaliana, and the RIKEN Integrated Protein Database containing 18 databases.
Since December 2009, international researchers have successfully used the system to identify 28 million data relationships, generating some 4.5 terabytes of associated files. Around 134,000 programs from non-RIKEN researchers have accessed the server as of March 2011. Biological applications include genome design, DNA sequence processing, and the inference of phenotype biological characteristics from genomic information.
“Our next goal is to develop and improve the system to increase its functionality and the usefulness of its linked open data to the worldwide biological community,” says Toyoda.
Kobayashi, N., Ishii, M., Takahashi, S., Mochizuki, Y., Matsushima, A. & Toyoda, T. Semantic-JSON: a lightweight web service interface for Semantic Web contents integrating multiple life science databases. Nucleic Acids Research published online 1 June, 2011 (doi: 10.1093/nar/gkr353).
Staying in Shape
16.08.2018 | Max-Planck-Institut für molekulare Zellbiologie und Genetik
Chips, light and coding moves the front line in beating bacteria
16.08.2018 | Okinawa Institute of Science and Technology (OIST) Graduate University
Scientists at the University of California, Los Angeles present new research on a curious cosmic phenomenon known as "whistlers" -- very low frequency packets...
Scientists develop first tool to use machine learning methods to compute flow around interactively designable 3D objects. Tool will be presented at this year’s prestigious SIGGRAPH conference.
When engineers or designers want to test the aerodynamic properties of the newly designed shape of a car, airplane, or other object, they would normally model...
Researchers from TU Graz and their industry partners have unveiled a world first: the prototype of a robot-controlled, high-speed combined charging system (CCS) for electric vehicles that enables series charging of cars in various parking positions.
Global demand for electric vehicles is forecast to rise sharply: by 2025, the number of new vehicle registrations is expected to reach 25 million per year....
Proteins must be folded correctly to fulfill their molecular functions in cells. Molecular assistants called chaperones help proteins exploit their inbuilt folding potential and reach the correct three-dimensional structure. Researchers at the Max Planck Institute of Biochemistry (MPIB) have demonstrated that actin, the most abundant protein in higher developed cells, does not have the inbuilt potential to fold and instead requires special assistance to fold into its active state. The chaperone TRiC uses a previously undescribed mechanism to perform actin folding. The study was recently published in the journal Cell.
Actin is the most abundant protein in highly developed cells and has diverse functions in processes like cell stabilization, cell division and muscle...
Scientists have discovered that the electrical resistance of a copper-oxide compound depends on the magnetic field in a very unusual way -- a finding that could help direct the search for materials that can perfectly conduct electricity at room temperatur
What happens when really powerful magnets--capable of producing magnetic fields nearly two million times stronger than Earth's--are applied to materials that...
08.08.2018 | Event News
27.07.2018 | Event News
25.07.2018 | Event News
16.08.2018 | Life Sciences
16.08.2018 | Earth Sciences
16.08.2018 | Life Sciences