Social Media Big Data Integration: A New Approach Based on Calibration

Forthcoming, Expert Systems with Applications (ESWA)

27 Pages Posted: 14 Sep 2017 Last revised: 22 Dec 2017

See all articles by Luciana Dalla Valle

Luciana Dalla Valle

University of Plymouth

Ron S. Kenett

KPA Ltd.; Technion-Israel Institute of Technology; Hebrew University of Jerusalem - Faculty of Medicine

Date Written: September 12, 2017

Abstract

In recent years, the growing availability of huge amounts of information, generated in every sector at high speed and in a wide variety of forms and formats, is unprecedented. The ability to harness big data is an opportunity to obtain more accurate analyses and to improve decision-making in industry, government and many other organizations. However, handling big data may be challenging and proper data integration is a key dimension in achieving high information quality. In this paper, we propose a novel approach to data integration that calibrates online generated big data with customer survey data. A common issue of customer surveys is that responses are often overly positive, making it difficult to identify areas of weaknesses in organizations. On the other hand, online reviews are often overly negative, hampering an accurate evaluation of areas of excellence. The proposed methodology calibrates the levels of unbalanced responses in different data sources via resampling and performs data integration using Bayesian Networks to propagate the new re-balanced information. In this paper we will show how, with a real data example, the novel data integration approach allows businesses and organization to correctly appraise the level of satisfaction of their customers. The real application is based on the integration of online data of review blogs and customer satisfaction surveys from the San Fransisco airport. We will illustrate that this integration enhances the information quality of the data analytic work in four of its dimensions, namely, Data Structure, Data Integration, Temporal Relevance and Chronology of Data and Goal.

Keywords: Bayesian Networks, Big Data, CCalibration, Data Integration, Social Media, Information Quality, Resampling Techniques

Suggested Citation

Dalla Valle, Luciana and Kenett, Ron S., Social Media Big Data Integration: A New Approach Based on Calibration (September 12, 2017). Forthcoming, Expert Systems with Applications (ESWA) . Available at SSRN: https://ssrn.com/abstract=3035807

Luciana Dalla Valle

University of Plymouth ( email )

Drake Circus
Plymouth, Devon PL22QZ
United Kingdom

Ron S. Kenett (Contact Author)

KPA Ltd. ( email )

Raanana
Israel
+97297408442 (Phone)
+97297408443 (Fax)

HOME PAGE: http://www.kpa-group.com

Technion-Israel Institute of Technology ( email )

Technion City
Haifa 32000, Haifa 32000
Israel

HOME PAGE: http://www.neaman.org.il/EN/Ron-Kenett

Hebrew University of Jerusalem - Faculty of Medicine ( email )

Jerusalem
Israel

Register to save articles to
your library

Register

Paper statistics

Downloads
57
Abstract Views
270
rank
371,619
PlumX Metrics