UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
Spherical Topic Models (2010)
Joseph Reisinger
,
Austin Waters
,
Bryan Silverthorn
, and
Raymond J. Mooney
We introduce the Spherical Admixture Model (SAM), a Bayesian topic model for arbitrary L2 normalized data. SAM maintains the same hierarchical structure as Latent Dirichlet Allocation (LDA), but models documents as points on a high-dimensional spherical manifold, allowing a natural likelihood parameterization in terms of cosine distance. Furthermore, SAM can model word absence/presence at the document level, and unlike previous models can assign explicit negative weight to topic terms. Performance is evaluated empirically, both through human ratings of topic quality and through diverse classification tasks from natural language processing and computer vision. In these experiments, SAM consistently outperforms existing models.
View:
PDF
Citation:
In
Proceedings of the 27th International Conference on Machine Learning (ICML 2010)
2010.
Bibtex:
@inproceedings{reisinger:icml10, title={Spherical Topic Models}, author={Joseph Reisinger and Austin Waters and Bryan Silverthorn and Raymond J. Mooney}, booktitle={Proceedings of the 27th International Conference on Machine Learning (ICML 2010)}, url="http://www.cs.utexas.edu/users/ai-lab?reisinger:icml10", year={2010} }
Presentation:
Slides (PDF)
People
Raymond J. Mooney
Faculty
mooney [at] cs utexas edu
Joseph Reisinger
Ph.D. Alumni
joeraii [at] cs utexas edu
Joseph Reisinger
Formerly affiliated Ph.D. Student
joeraii [at] cs utexas edu
Bryan Silverthorn
Ph.D. Alumni
bsilvert [at] cs utexas edu
Austin Waters
Ph.D. Alumni
austin [at] cs utexas edu
Areas of Interest
Text Categorization and Clustering
Text Data Mining
Unsupervised Learning, Clustering, and Self-Organization
Labs
Machine Learning
Neural Networks