Abstract

Abstract  Data Mining has evolved as a new discipline at the intersection of several existing areas, including Database Systems, Machine Learning, Optimization, and Statistics. An important question is whether the field has matured to the point where it has originatedsubstantial new problems and techniques that distinguish it from its parent disciplines. In this paper, we discuss a classof new problems and techniques that show great promise for exploratory mining, while synthesizing and generalizing ideas fromthe parent disciplines. While the class of problems we discuss is broad, there is a common underlying objective—to look beyonda single data-mining step (e.g., data summarization or model construction) and address the combined process of data selectionand transformation, parameter and algorithm selection, and model construction. The fundamental difficulty lies in the largespace of alternative choices at each step, and good solutions must provide a natural framework for managing this complexity.We regard this as a grand challenge for Data Mining, and see the ideas discussed here as promising initial steps towards arigorous exploratory framework that supports the entire process.

Description

SpringerLink - Journal Article

Links and resources

Tags

community

  • @dblp
  • @jhammerb
@jhammerb's tags highlighted