DIRECT: A System for Mining Data Value Conversion Rules from Disparate Data Sources
Virginia Polytechnic Institute & State University - Department of Accounting and Information Systems
National University of Singapore (NUS) - School of Computing
Massachusetts Institute of Technology (MIT) - Sloan School of Management
David W. Cheung
University of Hong Kong - Department of Computer Science and Information Systems
MIT Sloan Working Paper No. 4411-01
The successful integration of data from autonomous and heterogeneous systems calls for the resolution of semantic conflicts that may be present. Such conflicts are often reffected by discrepancies in attribute values of the same data object. In this paper, we describe a recently developed prototype system, DIRECT (DIscovering and REconciling ConflicTs). The system mines data value conversion rules in the process of integrating business data from multiple sources. The system architecture and functional modules are described. The process of discovering conversion rules from sales data of a trading company is presented as an illustrative example.
Number of Pages in PDF File: 51
Keywords: Data Integration, Data Mining, Semantic Conflicts, Data Visualization, Statistical Analysis, Data Value Conversionworking papers series
Date posted: February 6, 2003
© 2014 Social Science Electronic Publishing, Inc. All Rights Reserved.
This page was processed by apollo8 in 0.250 seconds