UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
A Comparison of Inference Techniques for Semi-supervised Clustering with Hidden Markov Random Fields (2004)
Mikhail Bilenko
and
Sugato Basu
Recently, a number of methods have been proposed for semi-supervised clustering that employ supervision in the form of pairwise constraints. We describe a probabilistic model for semi-supervised clustering based on Hidden Markov Random Fields (HMRFs) that incorporates relational supervision. The model leads to an EM-style clustering algorithm, the E-step of which requires collective assignment of instances to cluster centroids under the constraints. We evaluate three known techniques for such collective assignment: belief propagation, linear programming relaxation, and iterated conditional modes (ICM). The first two methods attempt to globally approximate the optimal assignment, while ICM is a greedy method. Experimental results indicate that global methods outperform the greedy approach when relational supervision is limited, while their benefits diminish as more pairwise constraints are provided.
View:
PDF
,
PS
Citation:
In
Proceedings of the ICML-2004 Workshop on Statistical Relational Learning and its Connections to Other Fields (SRL-2004)
, Banff, Canada, July 2004.
Bibtex:
@InProceedings{Bilenko:SRL-2004, title={A Comparison of Inference Techniques for Semi-supervised Clustering with Hidden Markov Random Fields}, author={Mikhail Bilenko and Sugato Basu}, booktitle={Proceedings of the ICML-2004 Workshop on Statistical Relational Learning and its Connections to Other Fields (SRL-2004)}, month={July}, address={Banff, Canada}, url="http://www.cs.utexas.edu/users/ai-lab?Bilenko:SRL-2004", year={2004} }
People
Sugato Basu
Ph.D. Alumni
sugato [at] cs utexas edu
Mikhail Bilenko
Ph.D. Alumni
mbilenko [at] microsoft com
Areas of Interest
Machine Learning
Semi-Supervised Learning
Statistical Relational Learning
Labs
Machine Learning