1. Processing and visualizing the data in tweets
- Author
-
Robert C. Miller, Samuel Madden, David R. Karger, Osama Badar, Michael S. Bernstein, Adam Marcus, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Marcus, Adam, Michael S. Bernstein, Badar, Osama, Karger, David R., Madden, Samuel R., and Miller, Robert C.
- Subjects
World Wide Web ,Structure (mathematical logic) ,Data model ,Microblogging ,Process (engineering) ,Computer science ,Data manipulation language ,Interface (computing) ,Social media ,InformationSystems_MISCELLANEOUS ,Software ,Information Systems - Abstract
Microblogs such as Twitter provide a valuable stream of diverse user-generated data. While the data extracted from Twitter is generally timely and accurate, the process by which developers extract structured data from the tweet stream is ad-hoc and requires reimplementation of common data manipulation primitives. In this paper, we present two systems for querying and extracting structure from Twitter-embedded data. The first, TweeQL, provides a streaming SQL-like interface to the Twitter API, making common tweet processing tasks simpler. The second, TwitInfo, shows how end-users can interact with and understand aggregated data from the tweet stream, in addition to showcasing the power of the TweeQL language. Together these systems show the richness of content that can be extracted from Twitter.
- Published
- 2012
- Full Text
- View/download PDF