UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
Detecting Promotional Content in Wikipedia (2013)
Shruti Bhosale
,
Heath Vinicombe
, and
Raymond J. Mooney
This paper presents an approach for detecting promotional content in Wikipedia. By incorporating stylometric features, including features based on n-gram and PCFG language models, we demonstrate improved accuracy at identifying promotional articles, compared to using only lexical information and meta-features.
View:
PDF
Citation:
In
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013)
, pp. 1851--1857, Seattle, WA, October 2013.
Bibtex:
@inproceedings{bhosale:emnlp13, title={Detecting Promotional Content in Wikipedia}, author={Shruti Bhosale and Heath Vinicombe and Raymond J. Mooney}, booktitle={Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP 2013)}, month={October}, address={Seattle, WA}, pages={1851--1857}, url="http://www.cs.utexas.edu/users/ai-labpub-view.php?PubID=127401", year={2013} }
Presentation:
Slides (PPT)
People
Shruti Bhosale
Formerly affiliated Masters Student
shruti [at] cs utexas edu
Raymond J. Mooney
Faculty
mooney [at] cs utexas edu
Heath Vinicombe
Formerly affiliated Masters Student
vini [at] cs utexas edu
Areas of Interest
Natural Language Processing
Text Categorization and Clustering
Labs
Machine Learning