At the highest level of description, this book is about data mining. However,
it focuses on data mining of very large amounts of data, that is, data so large
it does not fit in main memory. Because of the emphasis on size, many of our
examples are about the Web or data derived from the Web. Further, the book
takes an algorithmic point of view: data mining is about applying algorithms
to data, rather than using data to “train” a machine-learning engine of some
sort.
A java-based framework for index-structure supported knowledge discovery and data mining algorithms with a fundamental approach to separate data management (file parsers, database connections, data types) and algorithms (distances, distance functions, and data mining algorithms).
Mloss is a community effort at producing reproducible research via open source software, open access to data and results, and open standards for interchange.
R. Hosseini, P. Brusilovsky, M. Yudelson, and A. Hellas. Proceedings of the 25th Conference on User Modeling, Adaptation and Personalization, page 76--84. New York, NY, USA, ACM, (2017)