Data mining Twitter “tweets” may produce a gold mine for two University of Cincinnati computer science students.
William Clifton and Alex Padgett have developed a web-based application called The Tweetographer that allows users to learn about events in their cities or neighborhoods. The app works by collecting tweets sent by large numbers of Twitter users and extracting information about events – parties, concerts, games, etc. – happening nearby. It’s like a real-time events guide.
The Tweetographer was the senior project for the pair, who are graduating during the 2011-12 academic year, Padgett in December and Clifton in June.
“We wanted to explore data mining, which is an important area of research in Computer Science, in the context of social media,” Padgett said. “Although the concept will work with many social media platforms, Twitter was the most accessible. Everything is out there in public domain, a giant pool of untapped data, tagged with latitude and longitude. It’s very precise and lends itself to so many uses.”
That broad utility created some difficulty for the developers as they tried to formulate a focused project.
“We realized that we could do all sorts of things with this data. We could add all sorts of functions, but we worked really hard to avoid ‘feature creep’ and decided to focus on events,” Clifton said.
The Tweetographer, in practice, answers a common question for socially active people: “What’s happening?” Since people who use Twitter often tweet about where they are going or what they want to do, The Tweetographer answers that question by listening in to the chatter. A user can get a sense of not only what is going on, but how popular various events are.
The application is so effective that it was initially overwhelmed the volume of data streaming in through millions of tweets in some large cities.
“Eventually we were able to come up with a solution for this with a kind of queuing system that let us handle a stream of that magnitude,” Clifton said.
Another obstacle was making sense of all the available data. Although Twitter offers upwards of 140 million tweets a day, they are not posted in a uniform format.
“So many people type in their own shorthand,” Padgett said.
The solution, according to Clifton, was to create a “thesaurus” of multiple Twitter synonyms.
“Do you know how many ways people type ‘Tuesday’?” Clifton said.
All of the technical obstacles needed to be overcome on a tight deadline – just six months from assignment to presentation.
“If we had a couple of years, we could come up with something a lot more sophisticated,” Padgett said. “Everyone is their own worst critic, and we had very high standards. We wanted to show an elegant, simple solution.”
The Tweetographer got an enthusiastic reception at its unveiling.
“It blew people’s minds,” Clifton said. “One skeptic, in particular, wanted to test us. He said, ‘If I tweet right now, it will show up,’ and we said yes. He tweeted, and it popped up onscreen right away.”
The future of The Tweetographer is yet to be written. Padgett and Clifton are making plans beyond graduation, yet still actively working on improving and evolving this project. Clifton thinks the “engine” developed for The Tweetographer has other useful applications, such as predicting election outcomes, or compiling product reviews.
“So much is out there,” Clifton said.
Greg Hand | EurekAlert!
High Number of Science Enthusiasts in Switzerland
05.02.2018 | Universität Zürich
Between filter bubbles, uneven visibility and transnationality
06.12.2017 | Schweizerischer Nationalfonds SNF
University of Connecticut researchers have created a biodegradable composite made of silk fibers that can be used to repair broken load-bearing bones without the complications sometimes presented by other materials.
Repairing major load-bearing bones such as those in the leg can be a long and uncomfortable process.
Study published in the journal ACS Applied Materials & Interfaces is the outcome of an international effort that included teams from Dresden and Berlin in Germany, and the US.
Scientists at the Helmholtz-Zentrum Dresden-Rossendorf (HZDR) together with colleagues from the Helmholtz-Zentrum Berlin (HZB) and the University of Virginia...
Novel highly efficient and brilliant gamma-ray source: Based on model calculations, physicists of the Max PIanck Institute for Nuclear Physics in Heidelberg propose a novel method for an efficient high-brilliance gamma-ray source. A giant collimated gamma-ray pulse is generated from the interaction of a dense ultra-relativistic electron beam with a thin solid conductor. Energetic gamma-rays are copiously produced as the electron beam splits into filaments while propagating across the conductor. The resulting gamma-ray energy and flux enable novel experiments in nuclear and fundamental physics.
The typical wavelength of light interacting with an object of the microcosm scales with the size of this object. For atoms, this ranges from visible light to...
Stable joint cartilage can be produced from adult stem cells originating from bone marrow. This is made possible by inducing specific molecular processes occurring during embryonic cartilage formation, as researchers from the University and University Hospital of Basel report in the scientific journal PNAS.
Certain mesenchymal stem/stromal cells from the bone marrow of adults are considered extremely promising for skeletal tissue regeneration. These adult stem...
In the fight against cancer, scientists are developing new drugs to hit tumor cells at so far unused weak points. Such a “sore spot” is the protein complex...
13.04.2018 | Event News
12.04.2018 | Event News
09.04.2018 | Event News
20.04.2018 | Physics and Astronomy
20.04.2018 | Interdisciplinary Research
20.04.2018 | Physics and Astronomy