The quality of entries in the world's largest open-access online encyclopedia depends on how authors collaborate, University of Arizona Professor Sudha Ram finds.
The patterns of collaboration between Wikipedia contributors have a direct effect on the data quality of an article, according to a new paper co-authored by a University of Arizona professor and graduate student.
Sudha Ram, a UA's Eller College of Management professor, co-authored the article with Jun Liu, a graduate student in the management information systems department (MIS). Their work in this area received a "Best Paper Award" at the Workshop on Information Technology and Systems held in conjunction with the International Conference on Information Systems, or ICIS.
"Most of the existing research on Wikipedia is at the aggregate level, looking at total number of edits for an article, for example, or how many unique contributors participated in its creation," said Ram, who is a McClelland Professor of MIS in the Eller College.
"What was missing was an explanation for why some articles are of high quality and others are not," she said. "We investigated the relationship between collaboration and data quality."
Wikipedia has an internal quality rating system for entries, with featured articles at the top, followed by A, B, and C-level entries. Ram and Liu randomly collected 400 articles at each quality level and applied a data provenance model they developed in an earlier paper.
"We used data mining techniques and identified various patterns of collaboration based on the provenance or, more specifically, who does what to Wikipedia articles," Ram says. "These collaboration patterns either help increase quality or are detrimental to data quality."
Ram and Liu identified seven specific roles that Wikipedia contributors play.
Starters, for example, create sentences but seldom engage in other actions. Content justifiers create sentences and justify them with resources and links. Copy editors contribute primarily though modifying existing sentences. Some users – the all-round contributors – perform many different functions.
"We then clustered the articles based on these roles and examined the collaboration patterns within each cluster to see what kind of quality resulted," Ram said. "We found that all-round contributors dominated the best-quality entries. In the entries with the lowest quality, starters and casual contributors dominated."
To generate the best-quality entries, she says, people in many different roles must collaborate. Ram and Liu suggest that the results of this study should spark the design of software tools that can help improve quality.
"A software tool could prompt contributors to justify their insertions by adding links," she said, "and down the line, other software tools could encourage specific role setting and collaboration patterns to improve overall quality."
The impetus behind the paper came from Ram's involvement in UA's $50 million iPlant Collaborative, which is funded by the National Science Foundation and aims to unite the international scientific community around solving plant biology's "grand challenge" questions. Ram's role as a faculty advisor is to develop a cyberinfrastructure to facilitate collaboration.
"We initially suggested wikis for this, but we faced a lot of resistance," she said. Scientists expressed concerns ranging from lack of experience using the wikis to lack of incentive.
"We wondered how we could make people collaborate," Ram said. "So we looked at the English version of Wikipedia. There are more than three million entries, and thousands of people contribute voluntarily on a daily basis."
The results of this research have helped guide recommendations to the iPlant collaborators.
"If we want scientists to be collaborative," Ram said, "we need to assign them to specific roles and motivate them to police themselves and justify their contributions."
Liz Warren-Pederson | EurekAlert!
The plastic brain: Better connectivity of brain regions with training
02.07.2018 | Leibniz-Institut für Wissensmedien
Arguments, Emotions, and News distribution in social media - Leibniz-WissenschaftsCampus Tübingen
04.05.2018 | Leibniz-Institut für Wissensmedien
Scientists develop first tool to use machine learning methods to compute flow around interactively designable 3D objects. Tool will be presented at this year’s prestigious SIGGRAPH conference.
When engineers or designers want to test the aerodynamic properties of the newly designed shape of a car, airplane, or other object, they would normally model...
Researchers from TU Graz and their industry partners have unveiled a world first: the prototype of a robot-controlled, high-speed combined charging system (CCS) for electric vehicles that enables series charging of cars in various parking positions.
Global demand for electric vehicles is forecast to rise sharply: by 2025, the number of new vehicle registrations is expected to reach 25 million per year....
Proteins must be folded correctly to fulfill their molecular functions in cells. Molecular assistants called chaperones help proteins exploit their inbuilt folding potential and reach the correct three-dimensional structure. Researchers at the Max Planck Institute of Biochemistry (MPIB) have demonstrated that actin, the most abundant protein in higher developed cells, does not have the inbuilt potential to fold and instead requires special assistance to fold into its active state. The chaperone TRiC uses a previously undescribed mechanism to perform actin folding. The study was recently published in the journal Cell.
Actin is the most abundant protein in highly developed cells and has diverse functions in processes like cell stabilization, cell division and muscle...
Scientists have discovered that the electrical resistance of a copper-oxide compound depends on the magnetic field in a very unusual way -- a finding that could help direct the search for materials that can perfectly conduct electricity at room temperatur
What happens when really powerful magnets--capable of producing magnetic fields nearly two million times stronger than Earth's--are applied to materials that...
The quality of materials often depends on the manufacturing process. In casting and welding, for example, the rate at which melts solidify and the resulting microstructure of the alloy is important. With metallic foams as well, it depends on exactly how the foaming process takes place. To understand these processes fully requires fast sensing capability. The fastest 3D tomographic images to date have now been achieved at the BESSY II X-ray source operated by the Helmholtz-Zentrum Berlin.
Dr. Francisco Garcia-Moreno and his team have designed a turntable that rotates ultra-stably about its axis at a constant rotational speed. This really depends...
08.08.2018 | Event News
27.07.2018 | Event News
25.07.2018 | Event News
14.08.2018 | Information Technology
14.08.2018 | Life Sciences
14.08.2018 | Life Sciences