|
||||
|
||||
Querying Web-Sources within a Data FederationLynn WuUniversity of Pennsylvania - The Wharton School; Massachusetts Institute of Technology (MIT) Aykut FiratMassachusetts Institute of Technology (MIT) - Sloan School of Management Tarik AlatovicMassachusetts Institute of Technology (MIT) Stuart MadnickMassachusetts Institute of Technology (MIT) - Sloan School of Management August 2006 MIT Sloan Research Paper No. 4624-06 CISL Working Paper No. 2006-09 Abstract: The web is undoubtedly the largest and most diverse repository of data, but it was not designed to offer the capabilities of traditional data base management systems - which is unfortunate. In a true data federation, all types of data sources, such as relational databases and semi-structured websites, could be used together. IBM WebSphere uses the "request-reply-compensate" protocol to communicate with wrappers in a data federation. This protocol expects wrappers to reply to query requests by indicating the portion of the queries they can answer. While this provides a very generic approach to data federation, it also requires the wrapper developer to deal with some of the complexities of capability considerations through custom coding. Alternative approaches based on declarative capability restrictions have been proposed in the literature, but they have not found their way into commercial systems, perhaps due to their complexity. We offer a practical middle-ground solution to querying web-sources, using IBM's data federation system as an example. In lieu of a two-layered architecture consisting of wrapper and source layers, we propose to move the capability declaration from the wrapper layer to a single component between the wrapper and the native data source. The advantage of this three-layered architecture is that each new web-source only needs to register its capability with the capability-declaration component once, which saves the work of writing a new wrapper each time. Thus the inclusion of web-sources through this mechanism can be accelerated in a way that doesn't require a change in existing data federation technology.
Number of Pages in PDF File: 19 Keywords: federated data, web data sources, capabilities, query handling working papers seriesDate posted: August 28, 2006Suggested CitationContact Information
|
|
||||||||||||||||||||
© 2013 Social Science Electronic Publishing, Inc. All Rights Reserved.
FAQ
Terms of Use
Privacy Policy
Copyright
This page was processed by apollo1 in 2.485 seconds