Data Fusion Through Statistical Matching
13 Pages Posted: 5 Feb 2002
Date Written: 2002
In data mining applications, the availability of data is often a serious problem. For instance, elementary customer information resides in customer databases, but market survey data are only available for a subset of the customers or even for a different sample of customers. Data fusion provides a way out by combining information from different sources into a single data set for further data mining. While a significant amount of work has been done on data fusion in the past, most of the research has been performed outside of the data mining community. In this paper, we provide an overview of data fusion, introduce basic terminology and the statistical matching approach, distinguish between internal and external evaluation, and we conclude with a larger case study.
Keywords: Data Mining, Data Fusion, Leveraging of Sample Data
Suggested Citation: Suggested Citation