header

Semplore: A Scalable IR Approach to Search the Web of Data

12 Pages Posted: 9 Jul 2018 Publication Status: Accepted

See all articles by Haofen Wang

Haofen Wang

Shanghai Jiao Tong University (SJTU); Gowild Robotics Co. Ltd

Qiaoling Liu

Shanghai Jiao Tong University (SJTU)

Thomas Penin

Shanghai Jiao Tong University (SJTU)

Linyun Fu

Shanghai Jiao Tong University (SJTU)

Lei Zhang

IBM China Research Lab

Thanh Tran

Karlsruhe Institute of Technology - Institute of Applied Informatics and Formal Description Methods (AIFB)

Yong Yu

Shanghai Jiao Tong University (SJTU) - Apex Data & Knowledge Management Lab

Yue Pan

IBM China Research Lab

Abstract

The Web of Data keeps growing rapidly. However, the full exploitation of this large amount of structured data faces numerous challenges like usability, scalability, imprecise information needs and data change. We present Semplore, an IR-based system that aims at addressing these issues. Semplore supports intuitive faceted search and complex queries both on text and structured data. It combines imprecise keyword search and precise structured query in a unified ranking scheme. Scalable query processing is supported by leveraging inverted indexes traditionally used in IR systems. This is combined with a novel block-based index structure to support efficient index update when data changes. The experimental results show that Semplore is an efficient and effective system for searching the Web of Data and can be used as a basic infrastructure for Web-scale Semantic Web search engines.

Keywords: Scalable Query Processing, Inverted Index, Faceted Search, Search Result Ranking, Index Update

Suggested Citation

Wang, Haofen and Liu, Qiaoling and Penin, Thomas and Fu, Linyun and Zhang, Lei and Tran, Thanh and Yu, Yong and Pan, Yue, Semplore: A Scalable IR Approach to Search the Web of Data (September 1, 2009). Available at SSRN: https://ssrn.com/abstract=3199426 or http://dx.doi.org/10.2139/ssrn.3199426

Haofen Wang (Contact Author)

Shanghai Jiao Tong University (SJTU) ( email )

KoGuan Law School
Shanghai 200030, Shanghai 200052
China

Gowild Robotics Co. Ltd ( email )

Shenzhen, 518057
China

Qiaoling Liu

Shanghai Jiao Tong University (SJTU) ( email )

KoGuan Law School
Shanghai 200030, Shanghai 200052
China

Thomas Penin

Shanghai Jiao Tong University (SJTU) ( email )

KoGuan Law School
Shanghai 200030, Shanghai 200052
China

Linyun Fu

Shanghai Jiao Tong University (SJTU) ( email )

KoGuan Law School
Shanghai 200030, Shanghai 200052
China

Lei Zhang

IBM China Research Lab ( email )

Beijing, 100094
China

Thanh Tran

Karlsruhe Institute of Technology - Institute of Applied Informatics and Formal Description Methods (AIFB) ( email )

Kaiserstraße 12
Karlsruhe, Baden Württemberg 76131
Germany

Yong Yu

Shanghai Jiao Tong University (SJTU) - Apex Data & Knowledge Management Lab ( email )

311 Yifu Building
#800 Dongchuan Rd
Shanghai, 200240
China

Yue Pan

IBM China Research Lab ( email )

Beijing, 100094
China

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
36
Abstract Views
556
PlumX Metrics