In this post, I want to show how I use NLTK for preprocessing and tokenization, but then apply machine learning techniques (e.g. building a linear SVM using stochastic gradient descent) using Scikit-Learn.
In this post you will see 5 recipes of supervised classification algorithms applied to small standard datasets that are provided with the scikit-learn library.
A. Dargahi Nobari, N. Reshadatmand, und M. Neshati. Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, Seite 2035–2038. New York, NY, USA, Association for Computing Machinery, (2017)
S. Wang, J. Tang, C. Aggarwal, und H. Liu. Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, ACM, (Oktober 2016)
L. Hettinger, A. Zehe, A. Dallmann, und A. Hotho. INFORMATIK 2019: 50 Jahre Gesellschaft für Informatik – Informatik für Gesellschaft, Seite 191-204. Bonn, Gesellschaft für Informatik e.V., (2019)
L. Wang, Z. Cao, G. de Melo, und Z. Liu. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 1, Seite 1298--1307. (2016)
R. Girju, P. Nakov, V. Nastase, S. Szpakowicz, P. Turney, und D. Yuret. Proceedings of the 4th International Workshop on Semantic Evaluations, Seite 13--18. Stroudsburg, PA, USA, Association for Computational Linguistics, (2007)
J. Rotsztejn, N. Hollenstein, und C. Zhang. (2018)cite arxiv:1804.02042Comment: Accepted to SemEval 2018 (12th International Workshop on Semantic Evaluation).
D. Tang, F. Wei, N. Yang, M. Zhou, T. Liu, und B. Qin. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Seite 1555--1565. Baltimore, Maryland, Association for Computational Linguistics, (Juni 2014)
B. Rink, und S. Harabagiu. Proceedings of the 5th International Workshop on Semantic Evaluation, Seite 256--259. Association for Computational Linguistics, (2010)
K. Xu, Y. Feng, S. Huang, und D. Zhao. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing EMNLP, Seite 536–540. (2015)cite arxiv:1506.07650.
D. Ramage, D. Hall, R. Nallapati, und C. Manning. Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1, Seite 248--256. Stroudsburg, PA, USA, Association for Computational Linguistics, (2009)