Co-clustering Software (Version 1.1)
Announcement
The program
Co-cluster (Version 1.1) is a new version of
C++ program based on the cocluster (Version 1.0) written by
Hyuk Cho, Yuqiang Guan and Suvrit Sra, which
implements three
co-clustering algorithms: information-theoretic co-clustering algorithm
and two types of minimum sum-squared residue co-clustering algorithms
(see the papers for details). In our
implementation, all the algorithms have the ping-pong structure, i.e.,
a batch algorithm followed by corresponding chain of first variations.
Each algorithm also has five variations, based on in what order to
update the row or column centroids.
Download
Citation
You are welcome to use the code under the terms of the GNU Public License (GPL), however please acknowledge its use with a citation:
- Co-clustering of Human Cancer Microarrays using Minimum Sum-Squared Residue Co-clustering,
H. Cho and I.S. Dhillon,
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB), vol. 5:3, pages 385-400, July 2008.
Download: [pdf]
- Minimum Sum-Squared Residue Co-clustering of Gene Expression Data,
H. Cho, I.S. Dhillon, Y. Guan and S. Sra,
Proceedings of The fourth SIAM International Conference on Data Mining, pages 114-125, April 2004.
Download: [ps,
pdf]
- Information Theoretic Clustering of Sparse Co-Occurrence Data
I.S. Dhillon and Y. Guan,
Proceedings of The Third IEEE International Conference on Data Mining, pages 517-520, November 2003.
Download: [ps,
pdf]
(A longer version appears as UTCS Technical Report #TR-03-39, September 2003.
[Abstract & Download])
(Also, appears as "Clustering Large and Sparse Co-Occurrence Data", Workshop on Clustering High-Dimensional Data and its Applications
at The Third SIAM International Conference on Data Mining, May 2003.
Download: [ps,
pdf])
- Information-Theoretic Co-clustering,
I. S. Dhillon, S. Mallela, and D. S. Modha,
Proceedings of The Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining(KDD), pages 89-98, August 2003.
Download: [ps,
pdf]
(Also, appears as UTCS Technical Report #TR-03-12, April 2003.
[Abstract & Download])