Intelligent Assistance for the Data Mining Process: an Ontology-Based Approach

41 Pages Posted: 13 Oct 2008

See all articles by Abraham Bernstein

Abraham Bernstein

University of Zurich - Dynamic and Distributed Information Systems Group

Shawndra Hill

Microsoft Research

Foster Provost

New York University

Multiple version iconThere are 2 versions of this paper

Date Written: 2002

Abstract

A data mining (DM) process involves multiple stages. A simple, but typical, process might includepreprocessing data, applying a data-mining algorithm, and postprocessing the mining results. Thereare many possible choices for each stage, and only some combinations are valid. Because of thelarge space and non-trivial interactions, both novices and data-mining specialists need assistance incomposing and selecting DM processes. We present the concept of Intelligent Discovery Assistants(IDAs), which provide users with (i) systematic enumerations of valid DM processes, in order thatimportant, potentially fruitful options are not overlooked, and (ii) effective rankings of these validprocesses by different criteria, to facilitate the choice of DM processes to execute. We use a prototypeto show that an IDA can indeed provide useful enumerations and effective rankings. We discusshow an IDA is an important tool for knowledge sharing among a team of data miners. Finally,we illustrate all the claims with a comprehensive demonstration using a more involved process anddata from the 1998 KDDCUP competition.

Keywords: Data mining, data-mining process, intelligent assistants, knowledge discovery

Suggested Citation

Bernstein, Abraham and Hill, Shawndra and Provost, Foster, Intelligent Assistance for the Data Mining Process: an Ontology-Based Approach (2002). NYU Working Paper No. 2451/14146, Available at SSRN: https://ssrn.com/abstract=1282992

Abraham Bernstein (Contact Author)

University of Zurich - Dynamic and Distributed Information Systems Group ( email )

Plattenstrasse 14
Zurich
Switzerland

Shawndra Hill

Microsoft Research ( email )

New York, NY 10011
United States

Foster Provost

New York University ( email )

44 West Fourth Street
New York, NY 10012
United States

Here is the Coronavirus
related research on SSRN

Paper statistics

Downloads
63
Abstract Views
630
rank
193,103
PlumX Metrics