51 Pages Posted: 6 Feb 2003
Date Written: November 2001
The successful integration of data from autonomous and heterogeneous systems calls for the resolution of semantic conflicts that may be present. Such conflicts are often reffected by discrepancies in attribute values of the same data object. In this paper, we describe a recently developed prototype system, DIRECT (DIscovering and REconciling ConflicTs). The system mines data value conversion rules in the process of integrating business data from multiple sources. The system architecture and functional modules are described. The process of discovering conversion rules from sales data of a trading company is presented as an illustrative example.
Keywords: Data Integration, Data Mining, Semantic Conflicts, Data Visualization, Statistical Analysis, Data Value Conversion
Suggested Citation: Suggested Citation
Fan, Weiguo and Lu, Hongjun and Madnick, Stuart and Cheung, David W., DIRECT: A System for Mining Data Value Conversion Rules from Disparate Data Sources (November 2001). MIT Sloan Working Paper No. 4411-01. Available at SSRN: https://ssrn.com/abstract=377900 or http://dx.doi.org/10.2139/ssrn.377900