Content & Structure Coverage: Extracting a Diverse Information Subset

INFORMS Journal on Computing, 2017, 29(4): 660-675

Posted: 17 Oct 2014 Last revised: 13 Aug 2017

See all articles by Baojun Ma

Baojun Ma

Beijing University of Posts and Telecommunication - School of Economics and Management

Qiang Wei

Tsinghua University - School of Economics & Management

Guoqing Chen

Tsinghua University - School of Economics & Management

Jin Zhang

Renmin University of China - School of Business

Xunhua Guo

Tsinghua University - School of Economics & Management

Date Written: August 11, 2017

Abstract

Recent years have witnessed a rapid increase of online data volume and the growing challenge of information overload for Web use and applications, thus information diversity is of great importance to both providers and users of search services. Based on the proposed diversity evaluation measure (namely, information coverage), two heuristic methods with corresponding algorithms are designed upon the greedy submodular idea. First, this paper proposes the CovC S-Select algorithm, which possesses the characteristic of asymptotic optimality, to optimize information coverage by applying the strategy of simulated annealing. Then to accelerate the speed of CovC S-Select algorithm, its fast approximation called FastCovC S-Select has been introduced by utilizing the properties of information coverage. Furthermore, ample experiments have been conducted to show the effectiveness, efficiency and parameter robustness of proposed diversity-oriented extraction methods in this paper, along with comparative analyses revealing their performances advantageous over other related methods.

Keywords: Information diversity, Information content coverage, Information structure coverage, Submodularity, Fast approximation

Suggested Citation

Ma, Baojun and Wei, Qiang and Chen, Guoqing and Zhang, Jin and Guo, Xunhua, Content & Structure Coverage: Extracting a Diverse Information Subset (August 11, 2017). INFORMS Journal on Computing, 2017, 29(4): 660-675. Available at SSRN: https://ssrn.com/abstract=2510350 or http://dx.doi.org/10.2139/ssrn.2510350

Baojun Ma

Beijing University of Posts and Telecommunication - School of Economics and Management ( email )

Mailbox 164, School of Economics and Management
No. 10 Xitucheng Road, Haidian District
Beijing, Beijing 100876
China

HOME PAGE: http://www.baojunma.com

Qiang Wei (Contact Author)

Tsinghua University - School of Economics & Management ( email )

Beijing, 100084
China
+86-10-62789824 (Phone)
+86-10-62771647 (Fax)

HOME PAGE: http://www.sem.tsinghua.edu.cn/en/weiq

Guoqing Chen

Tsinghua University - School of Economics & Management ( email )

Beijing, 100084
China
+86-10-62789925 (Phone)
+86-10-62789925 (Fax)

HOME PAGE: http://www.sem.tsinghua.edu.cn/en/chengq

Jin Zhang

Renmin University of China - School of Business ( email )

Beijing
China

HOME PAGE: http://en.rbs.ruc.edu.cn/templates/T_eng_new_list2/index3.aspx?nodeid=222&page=ContentPage&contentid

Xunhua Guo

Tsinghua University - School of Economics & Management ( email )

Beijing, 100084
China

Register to save articles to
your library

Register

Paper statistics

Abstract Views
373
PlumX