Following up on KMeans Clustering Now Running on Elastic MapReduce, Stephen Green has generously documented the steps that was necessary to get an example of k-Means clustering up and running on Amazon’s Elastic MapReduce (EMR) on the Apache Lucene Mahout wiki.
D. Arthur, und S. Vassilvitskii. SODA '07: Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, Seite 1027--1035. Philadelphia, PA, USA, Society for Industrial and Applied Mathematics, (2007)
G. Hamerly, und C. Elkan. CIKM '02: Proceedings of the eleventh international conference on Information and knowledge management, Seite 600--607. New York, NY, USA, ACM, (2002)
J. MacQueen. Proc. of the fifth Berkeley Symposium on Mathematical Statistics and Probability, 1, Seite 281-297. University of California Press, (1967)
D. Cutting, D. Karger, J. Pedersen, und J. Tukey. SIGIR '92: Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval, Seite 318--329. New York, NY, USA, ACM Press, (1992)
A. Hotho, A. Maedche, und S. Staab. ICDM '01: Proceedings of the 2001 IEEE International Conference on Data Mining, Seite 607--608. Washington, DC, USA, IEEE Computer Society, (2001)
A. Hotho, A. Maedche, und S. Staab. ICDM '01: Proceedings of the 2001 IEEE International Conference on Data Mining, Seite 607--608. Washington, DC, USA, IEEE Computer Society, (2001)
A. Phansalkar, A. Joshi, L. Eeckhout, und L. John. IEEE International Symposium on Performance Analysis of Systems and Software, 2005. ISPASS 2005., Seite 10--20. (März 2005)
S. Basu, A. Banerjee, und R. Mooney. Proceedings of the 2004 SIAM International Conference on Data Mining, Seite 333--344. Lake Buena Vista, FL, Society for Industrial and Applied Mathematics, (April 2004)