A new study finds that -- even in a field with clear standards and online databases -- the rate of public data archiving in cancer research is increasing only slowly. Furthermore, research studies in cancer and human subjects are less likely than other research studies to make their datasets available for reuse.
The results come from a study of patterns of research data availability conducted by Dr Heather Piwowar of the National Evolutionary Synthesis Center.
Data collected in scientific research is often useful for future studies by other investigators, but scientists have rarely made their raw research data widely available. Tools and initiatives are underway to encourage scientists to publicly archive their data. This analysis confirms there is still much room for improvement.
By querying the full text of the scientific literature through websites like Google Scholar and PubMed Central, Piwowar identified eleven thousand studies that collected a particular type of data about cellular activity, called gene expression microarray data. Only 45% of recent gene expression studies were found to have deposited their data in the public databases developed for this purpose. The rate of data publication has increased only slightly from 2007 to 2009. Data is shared least often from studies on cancer and human subjects: cancer studies make their data available for wide reuse half as often as similar studies outside cancer.
"It was disheartening to discover that studies on cancer and human subjects were least likely to make their data available. These data are surely some of the most valuable for reuse, to confirm, refute, inform and advance bench-to-bedside translational research," Piwowar said.
"We want as much scientific progress as we can get from our tax and charity dollars. This requires increased access to data resources. Data can be shared while maintaining patient privacy," Piwowar added, noting that patient re-identification is rarely an issue for gene expression microarray studies.
Most likely to share their data in public databases were investigators from Stanford University and those who published in the journal Physiological Genomics.
Scientist sometimes email each other to request datasets that aren't available online, but these requests often go unanswered or are denied by the original investigators. Publishing data in online data repositories is considered the best way to share data for future reuse.
Recent policies by the NSF seek to increase the amount of data disseminated from federally-funded research by requiring data management and dissemination plans in all new grant applications.
The findings were published July 13th in the open access journal PLoS ONE.
CITATION: Piwowar, H. (2011). "Who shares? Who doesn't? Factors associated with openly archiving raw research data." PLoS ONE 6(7): e18657. doi:18610.11371/journal.pone.0018657
In the spirit of the topic, the raw data behind this study are publicly available in the Dryad Digital Repository at http://dx.doi.org/10.5061/dryad.mf1sd.
The National Evolutionary Synthesis Center (NESCent) is a nonprofit science center dedicated to cross-disciplinary research in evolution. Funded by the National Science Foundation, NESCent is jointly operated by Duke University, The University of North Carolina at Chapel Hill, and North Carolina State University. For more information about research and training opportunities at NESCent, visit www.nescent.org.
Heather Piwowar | EurekAlert!
Multi-year study finds 'hotspots' of ammonia over world's major agricultural areas
17.03.2017 | University of Maryland
Diabetes Drug May Improve Bone Fat-induced Defects of Fracture Healing
17.03.2017 | Deutsches Institut für Ernährungsforschung Potsdam-Rehbrücke
The Institute of Semiconductor Technology and the Institute of Physical and Theoretical Chemistry, both members of the Laboratory for Emerging Nanometrology (LENA), at Technische Universität Braunschweig are partners in a new European research project entitled ChipScope, which aims to develop a completely new and extremely small optical microscope capable of observing the interior of living cells in real time. A consortium of 7 partners from 5 countries will tackle this issue with very ambitious objectives during a four-year research program.
To demonstrate the usefulness of this new scientific tool, at the end of the project the developed chip-sized microscope will be used to observe in real-time...
Astronomers from Bonn and Tautenburg in Thuringia (Germany) used the 100-m radio telescope at Effelsberg to observe several galaxy clusters. At the edges of these large accumulations of dark matter, stellar systems (galaxies), hot gas, and charged particles, they found magnetic fields that are exceptionally ordered over distances of many million light years. This makes them the most extended magnetic fields in the universe known so far.
The results will be published on March 22 in the journal „Astronomy & Astrophysics“.
Galaxy clusters are the largest gravitationally bound structures in the universe. With a typical extent of about 10 million light years, i.e. 100 times the...
Researchers at the Goethe University Frankfurt, together with partners from the University of Tübingen in Germany and Queen Mary University as well as Francis Crick Institute from London (UK) have developed a novel technology to decipher the secret ubiquitin code.
Ubiquitin is a small protein that can be linked to other cellular proteins, thereby controlling and modulating their functions. The attachment occurs in many...
In the eternal search for next generation high-efficiency solar cells and LEDs, scientists at Los Alamos National Laboratory and their partners are creating...
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are less stable. Now researchers at the Technical University of Munich (TUM) have, for the first time ever, produced a composite material combining silicon nanosheets and a polymer that is both UV-resistant and easy to process. This brings the scientists a significant step closer to industrial applications like flexible displays and photosensors.
Silicon nanosheets are thin, two-dimensional layers with exceptional optoelectronic properties very similar to those of graphene. Albeit, the nanosheets are...
20.03.2017 | Event News
14.03.2017 | Event News
07.03.2017 | Event News
28.03.2017 | Life Sciences
28.03.2017 | Information Technology
28.03.2017 | Physics and Astronomy