5 results on '"B. Thomas Adler"'
Search Results
2. Wikipedia Vandalism Detection: Combining Natural Language, Metadata, and Reputation Features
- Author
-
Andrew G. West, B. Thomas Adler, Paolo Rosso, Luca de Alfaro, and Santiago M. Mola-Velasco
- Subjects
Database Management ,Information retrieval ,Computer science ,media_common.quotation_subject ,Data Mining and Knowledge Discovery ,Constructive ,GeneralLiterature_MISCELLANEOUS ,Task (project management) ,Set (abstract data type) ,World Wide Web ,Metadata ,Online encyclopedia ,LENGUAJES Y SISTEMAS INFORMATICOS ,Natural language ,Reputation ,media_common - Abstract
Wikipedia is an online encyclopedia which anyone can edit. While most edits are constructive, about 7% are acts of vandalism. Such behavior is characterized by modifications made in bad faith; introducing spam and other inappropriate content. In this work, we present the results of an effort to integrate three of the leading approaches to Wikipedia vandalism detection: a spatio-temporal analysis of metadata (STiki), a reputation-based system (WikiTrust), and natural language processing features. The performance of the resulting joint system improves the state-of-the-art from all previous methods and establishes a new baseline for Wikipedia vandalism detection. We examine in detail the contribution of the three approaches, both for the task of discovering fresh vandalism, and for the task of locating vandalism in the complete set of Wikipedia revisions., The authors from Universitat Politècnica de València thank also the MICINN research project TEXT-ENTERPRISE 2.0 TIN2009-13391-C04-03 (Plan I+D+i). UPenn contributions were supported in part by ONR MURI N00014-07-1-0907. This research was partially supported by award 1R01GM089820-01A1 from the National Institute Of General Medical Sciences, and by ISSDM, a UCSC-LANL educational collaboration.
- Published
- 2011
3. Assigning trust to Wikipedia content
- Author
-
Ian Pye, Krishnendu Chatterjee, Luca de Alfaro, Marco Faella, B. Thomas Adler, Vishwanath Raman, B. T., Adler, K., Chatterjee, L., de Alfaro, Faella, Marco, I., Pye, and V., Raman
- Subjects
World Wide Web ,Computer science ,Reputation system ,media_common.quotation_subject ,Reliability (computer networking) ,Encyclopedia ,Key (cryptography) ,Content (Freudian dream analysis) ,Word (computer architecture) ,Reputation ,media_common - Abstract
The Wikipedia is a collaborative encyclopedia: anyone can contribute to its articles simply by clicking on an "edit" button. The open nature of the Wikipedia has been key to its success, but has also created a challenge: how can readers develop an informed opinion on its reliability? We propose a system that computes quantitative values of trust for the text in Wikipedia articles; these trust values provide an indication of text reliability.The system uses as input the revision history of each article, as well as information about the reputation of the contributing authors, as provided by a reputation system. The trust of a word in an article is computed on the basis of the reputation of the original author of the word, as well as the reputation of all authors who edited text near the word. The algorithm computes word trust values that vary smoothly across the text; the trust values can be visualized using varying text-background colors. The algorithm ensures that all changes to an article's text are reflected in the trust values, preventing surreptitious content changes.We have implemented the proposed system, and we have used it to compute and display the trust of the text of thousands of articles of the English Wikipedia. To validate our trust-computation algorithms, we show that text labeled as low-trust has a significantly higher probability of being edited in the future than text labeled as high-trust.
- Published
- 2008
4. Measuring author contributions to the Wikipedia
- Author
-
Vishwanath Raman, B. Thomas Adler, Luca de Alfaro, and Ian Pye
- Subjects
World Wide Web ,Information retrieval ,Order (exchange) ,If and only if ,Computer science ,media_common.quotation_subject ,Rank (computer programming) ,Revenue ,Quality (business) ,Context (language use) ,Outcome (game theory) ,media_common ,Zero (linguistics) - Abstract
We consider the problem of measuring user contributions to versioned, collaborative bodies of information, such as wikis. Measuring the contributions of individual authors can be used to divide revenue, to recognize merit, to award status promotions, and to choose the order of authors when citing the content. In the context of the Wikipedia, previous works on author contribution estimation have focused on two criteria: the total text created, and the total number of edits performed. We show that neither of these criteria work well: both techniques are vulnerable to manipulation, and the total-text criterion fails to reward people who polish or re-arrange the content.We consider and compare various alternative criteria that take into account the quality of a contribution, in addition to the quantity, and we analyze how the criteria differ in the way they rank authors according to their contributions. As an outcome of this study, we propose to adopt total edit longevity as a measure of author contribution. Edit longevity is resistant to simple attacks, since edits are counted towards an author's contribution only if other authors accept the contribution. Edit longevity equally rewards people who create content, and people who rearrange or polish the content. Finally, edit longevity distinguishes the people who contribute little (who have contribution close to zero) from spammers or vandals, whose contribution quickly grows negative.
- Published
- 2008
5. A content-driven reputation system for the wikipedia
- Author
-
Luca de Alfaro and B. Thomas Adler
- Subjects
World Wide Web ,Incentive ,Computer science ,Order (business) ,Reputation system ,media_common.quotation_subject ,Content (Freudian dream analysis) ,Reputation ,media_common - Abstract
We present a content-driven reputation system for Wikipedia authors. In our system, authors gain reputation when the edits they perform to Wikipedia articles are preserved by subsequent authors, and they lose reputation when their edits are rolled back or undone in short order. Thus, author reputation is computed solely on the basis of content evolution; user-to-user comments or ratings are not used. The author reputation we compute could be used to flag new contributions from low-reputation authors, or it could be used to allow only authors with high reputation to contribute to controversialor critical pages. A reputation system for the Wikipedia could also provide an incentive for high-quality contributions. We have implemented the proposed system, and we have used it to analyze the entire Italian and French Wikipedias, consisting of a total of 691, 551 pages and 5, 587, 523 revisions. Our results show that our notion of reputation has good predictive value: changes performed by low-reputation authors have a significantly larger than average probability of having poor quality, as judged by human observers, and of being later undone, as measured by our algorithms.
- Published
- 2007
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.