Designing Ranking Systems for Hotels on Travel Search Engines by Mining User-Generated and Crowd-Sourced Content

60 Pages Posted: 2 Jun 2011 Last revised: 30 Nov 2011

Anindya Ghose

New York University (NYU) - Leonard N. Stern School of Business

Panagiotis G. Ipeirotis

New York University - Leonard N. Stern School of Business

Beibei Li

Carnegie Mellon University - H. John Heinz III School of Public Policy and Management

Date Written: June 1, 2011

Abstract

User-Generated Content (UGC) on social media platforms and product search engines is changing the way consumers shop for goods online. However, current product search engines fail to effectively leverage information created across diverse social media platforms. Moreover, current ranking algorithms in these product search engines tend to induce consumers to focus on one single product characteristic dimension (e.g., price, star rating). This approach largely ignores consumers’ multi-dimensional preferences for products. In this paper, we propose to generate a ranking system that recommends products that provide on average the best value for the consumer’s money. The key idea is that products that provide a higher surplus should be ranked higher on the screen in response to consumer queries. We use a unique dataset of U.S. hotel reservations made over a three-month period through Travelocity, which we supplement with data from various social media sources using techniques from text mining, image classification, social geo-tagging, human annotations, and geo-mapping. We propose a random coefficient hybrid structural model, taking into consideration the two sources of consumer heterogeneity the different travel occasions and different hotel characteristics introduce. Based on the estimates from the model, we infer the economic impact of various location and service characteristics of hotels. We then propose a new hotel ranking system based on the average utility gain a consumer receives from staying in a particular hotel. By doing so, we can provide customers with the “best-value" hotels early on. Our user studies, using ranking comparisons from several thousand users, validates the superiority of our ranking system relative to existing systems on several travel search engines. On a broader note, this paper illustrates how social media can be mined and incorporated into a demand-estimation model in order to generate a new ranking system in product search engines. We thus highlight the tight linkages between user behavior on social media and search engines. Our inter-disciplinary approach provides several insights for using machine learning techniques in economics and marketing research.

Keywords: User Generated Content, Social Media, Travel Search, Structural Modeling, Hotel Demand Estimation, Text mining, Crowd Sourcing

JEL Classification: C35, C81, D4, D12, D43, L13, L14, M31

Suggested Citation

Ghose, Anindya and Ipeirotis, Panagiotis G. and Li, Beibei, Designing Ranking Systems for Hotels on Travel Search Engines by Mining User-Generated and Crowd-Sourced Content (June 1, 2011). Available at SSRN: https://ssrn.com/abstract=1856558 or http://dx.doi.org/10.2139/ssrn.1856558

Anindya Ghose (Contact Author)

New York University (NYU) - Leonard N. Stern School of Business ( email )

44 West 4th Street
New York, NY NY 10012
United States

Panagiotis G. Ipeirotis

New York University - Leonard N. Stern School of Business ( email )

44 West Fourth Street
Ste 8-84
New York, NY 10012
United States
+1-212-998-0803 (Phone)

HOME PAGE: http://www.stern.nyu.edu/~panos

Beibei Li

Carnegie Mellon University - H. John Heinz III School of Public Policy and Management ( email )

Pittsburgh, PA 15213-3890
United States

Paper statistics

Downloads
838
Rank
21,662
Abstract Views
4,058