Knowledge Discovery from Databases: the Nyu Project

15 Pages Posted: 15 Oct 2008

See all articles by James Clifford

James Clifford

affiliation not provided to SSRN

Vasant Dhar

New York University (NYU) - Leonard N. Stern School of Business; New York University (NYU) - Department of Information, Operations, and Management Sciences

Alexander Tuzhilin

New York University (NYU) - Leonard N. Stern School of Business; New York University (NYU) - Department of Information, Operations, and Management Sciences

Date Written: February 1995

Abstract

More and more application domains, from financial market analysis to weatherprediction, from monitoring supermarket purchases to monitoring satellite images, arebecomingly increasingly data-intensive. The result is massive databases that are growingat a rapid rate - it has been estimated that the worldâ¬Â"s electronic data almostdoubles every year. With this rate of data explosion, there is a pressing need for computersto play an increasing role in analyzing these huge data repositories which areimpossible to penetrate manually. The challenge is to ferret out the regularities in thedata that will prove to be interesting to the user.A group in the Information Systems department at the NYU Business School hasbeen working in this area for a number of years. The focus of our project is now on thediscovery of patterns from time series data. In this paper we give an overview of thekinds of databases we are "miningâ¬Â? and the kinds of temporal patterns and rules whichwe are attempting to discover. In the first phase of this research, we have developed ataxonomy of patterns as a way to organize our research agenda. We wish to share thetaxonomy with the research community in the "knowledge discovery in databases" areasince we have found it useful in classifying the universe of regularities or patterns intodistinct types, that is, patterns which differ in terms of their structure and the amount6f search effort required to find them. Although the primary focus of our project ison time series data, and the examples we will present are chosen from this arena, thetaxonomy is general enough to apply to any type of data.

Suggested Citation

Clifford, James and Dhar, Vasant and Tuzhilin, Alexander, Knowledge Discovery from Databases: the Nyu Project (February 1995). NYU Working Paper No. 2451/14220, Available at SSRN: https://ssrn.com/abstract=1284837

James Clifford

affiliation not provided to SSRN

Vasant Dhar

New York University (NYU) - Leonard N. Stern School of Business ( email )

44 West 4th Street
Suite 9-160
New York, NY NY 10012
United States

HOME PAGE: http://www.stern.nyu.edu/~vdhar

New York University (NYU) - Department of Information, Operations, and Management Sciences

44 West Fourth Street
New York, NY 10012
United States

Alexander Tuzhilin

New York University (NYU) - Leonard N. Stern School of Business ( email )

44 West 4th Street
Suite 9-160
New York, NY NY 10012
United States

New York University (NYU) - Department of Information, Operations, and Management Sciences

44 West Fourth Street
New York, NY 10012
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
38
Abstract Views
1,131
PlumX Metrics