Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.
D. Arthur, and S. Vassilvitskii. SODA '07: Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, page 1027--1035. Philadelphia, PA, USA, Society for Industrial and Applied Mathematics, (2007)
D. Arthur, and S. Vassilvitskii. SODA '07: Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, page 1027--1035. Philadelphia, PA, USA, Society for Industrial and Applied Mathematics, (2007)
I. Yoo, and X. Hu. JCDL '06: Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, page 220--229. New York, NY, USA, ACM Press, (2006)
A. Phansalkar, A. Joshi, L. Eeckhout, and L. John. IEEE International Symposium on Performance Analysis of Systems and Software, 2005. ISPASS 2005., page 10--20. (March 2005)
S. Basu, A. Banerjee, and R. Mooney. Proceedings of the 2004 SIAM International Conference on Data Mining, page 333--344. Lake Buena Vista, FL, Society for Industrial and Applied Mathematics, (April 2004)