Data mining Twitter “tweets” may produce a gold mine for two University of Cincinnati computer science students.
William Clifton and Alex Padgett have developed a web-based application called The Tweetographer that allows users to learn about events in their cities or neighborhoods. The app works by collecting tweets sent by large numbers of Twitter users and extracting information about events – parties, concerts, games, etc. – happening nearby. It’s like a real-time events guide.
The Tweetographer was the senior project for the pair, who are graduating during the 2011-12 academic year, Padgett in December and Clifton in June.
“We wanted to explore data mining, which is an important area of research in Computer Science, in the context of social media,” Padgett said. “Although the concept will work with many social media platforms, Twitter was the most accessible. Everything is out there in public domain, a giant pool of untapped data, tagged with latitude and longitude. It’s very precise and lends itself to so many uses.”
That broad utility created some difficulty for the developers as they tried to formulate a focused project.
“We realized that we could do all sorts of things with this data. We could add all sorts of functions, but we worked really hard to avoid ‘feature creep’ and decided to focus on events,” Clifton said.
The Tweetographer, in practice, answers a common question for socially active people: “What’s happening?” Since people who use Twitter often tweet about where they are going or what they want to do, The Tweetographer answers that question by listening in to the chatter. A user can get a sense of not only what is going on, but how popular various events are.
The application is so effective that it was initially overwhelmed the volume of data streaming in through millions of tweets in some large cities.
“Eventually we were able to come up with a solution for this with a kind of queuing system that let us handle a stream of that magnitude,” Clifton said.
Another obstacle was making sense of all the available data. Although Twitter offers upwards of 140 million tweets a day, they are not posted in a uniform format.
“So many people type in their own shorthand,” Padgett said.
The solution, according to Clifton, was to create a “thesaurus” of multiple Twitter synonyms.
“Do you know how many ways people type ‘Tuesday’?” Clifton said.
All of the technical obstacles needed to be overcome on a tight deadline – just six months from assignment to presentation.
“If we had a couple of years, we could come up with something a lot more sophisticated,” Padgett said. “Everyone is their own worst critic, and we had very high standards. We wanted to show an elegant, simple solution.”
The Tweetographer got an enthusiastic reception at its unveiling.
“It blew people’s minds,” Clifton said. “One skeptic, in particular, wanted to test us. He said, ‘If I tweet right now, it will show up,’ and we said yes. He tweeted, and it popped up onscreen right away.”
The future of The Tweetographer is yet to be written. Padgett and Clifton are making plans beyond graduation, yet still actively working on improving and evolving this project. Clifton thinks the “engine” developed for The Tweetographer has other useful applications, such as predicting election outcomes, or compiling product reviews.
“So much is out there,” Clifton said.
Greg Hand | EurekAlert!
Arguments, Emotions, and News distribution in social media - Leibniz-WissenschaftsCampus Tübingen
04.05.2018 | Leibniz-Institut für Wissensmedien
High Number of Science Enthusiasts in Switzerland
05.02.2018 | Universität Zürich
In a recent publication in the renowned journal Optica, scientists of Leibniz-Institute of Photonic Technology (Leibniz IPHT) in Jena showed that they can accurately control the optical properties of liquid-core fiber lasers and therefore their spectral band width by temperature and pressure tuning.
Already last year, the researchers provided experimental proof of a new dynamic of hybrid solitons– temporally and spectrally stationary light waves resulting...
Scientists from the University of Freiburg and the University of Basel identified a master regulator for bone regeneration. Prasad Shastri, Professor of...
Moving into its fourth decade, AchemAsia is setting out for new horizons: The International Expo and Innovation Forum for Sustainable Chemical Production will take place from 21-23 May 2019 in Shanghai, China. With an updated event profile, the eleventh edition focusses on topics that are especially relevant for the Chinese process industry, putting a strong emphasis on sustainability and innovation.
Founded in 1989 as a spin-off of ACHEMA to cater to the needs of China’s then developing industry, AchemAsia has since grown into a platform where the latest...
The BMBF-funded OWICELLS project was successfully completed with a final presentation at the BMW plant in Munich. The presentation demonstrated a Li-Fi communication with a mobile robot, while the robot carried out usual production processes (welding, moving and testing parts) in a 5x5m² production cell. The robust, optical wireless transmission is based on spatial diversity; in other words, data is sent and received simultaneously by several LEDs and several photodiodes. The system can transmit data at more than 100 Mbit/s and five milliseconds latency.
Modern production technologies in the automobile industry must become more flexible in order to fulfil individual customer requirements.
An international team of scientists has discovered a new way to transfer image information through multimodal fibers with almost no distortion - even if the fiber is bent. The results of the study, to which scientist from the Leibniz-Institute of Photonic Technology Jena (Leibniz IPHT) contributed, were published on 6thJune in the highly-cited journal Physical Review Letters.
Endoscopes allow doctors to see into a patient’s body like through a keyhole. Typically, the images are transmitted via a bundle of several hundreds of optical...
13.06.2018 | Event News
08.06.2018 | Event News
05.06.2018 | Event News
22.06.2018 | Materials Sciences
22.06.2018 | Earth Sciences
22.06.2018 | Life Sciences