In this post, I want to show how I use NLTK for preprocessing and tokenization, but then apply machine learning techniques (e.g. building a linear SVM using stochastic gradient descent) using Scikit-Learn.
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
You've built a vibrant community of Family Guy enthusiasts. The SVD recommendation algorithm took your site to the next level by allowing you to leverage the implicit knowledge of your community. But now you're ready for the next iteration - you are about
W. Smart, and M. Zhang. CRPIT '04: Proceedings of the second workshop on
Australasian information security, Data Mining and Web
Intelligence, and Software Internationalisation, 32 no. 7, page 133--138. Dunedin, New Zealand, Australian Computer Society, Inc., (January 2004)
A. Almal, A. Mitra, R. Datar, P. Lenehan, D. Fry, R. Cote, and W. Worzel. GECCO 2006: Proceedings of the 8th annual conference
on Genetic and evolutionary computation, 1, page 239--246. Seattle, Washington, USA, ACM Press, (8-12 July 2006)
V. Spiliopoulos, A. Valarakos, and G. Vouros. Proceedings of the 5th European Semantic Web Conference, Berlin, Heidelberg, Springer Verlag, (June 2008)
R. Schapire, Y. Singer, and A. Singhal. Proceedings of SIGIR-98, 21st ACM International Conference on
Research and Development in Information Retrieval, page 215--223. Melbourne, Australia, ACM Press, New York, US, (1998)