header

Tailored Semantic Annotation for Semantic Search

22 Pages Posted: 10 Jul 2018 First Look: Accepted

See all articles by Rafael Berlanga

Rafael Berlanga

Jaume I University - Department of Computer Languages and Systems

Victoria Nebot

Jaume I University - Department of Computer Languages and Systems

Maria Pérez

Jaume I University - Department of Computer Languages and Systems

Abstract

This paper presents a novel method for semantic annotation and search of a target corpus using several knowledge resources (KRs). This method relies on a formal statistical framework in which KR concepts and corpus documents are homogeneously represented using statistical language models. Under this framework, we can perform all the necessary operations for an efficient and effective semantic annotation of the corpus.  Firstly, we propose a coarse tailoring of the KRs w.r.t the target corpus with the main goal of reducing the ambiguity of the annotations and their computational overhead. Then, we propose the generation of concept profiles, which allow measuring the semantic overlap of the KRs as well as performing a finer tailoring of them. Finally, we propose how to semantically represent documents and queries in terms of the KRs concepts and the statistical framework to perform semantic search. Experiments have been carried out with a corpus about web resources which includes several Life Sciences catalogues and Wikipedia pages related to web resources in general (e.g., databases, tools, services, etc). Results demonstrate that the proposed method is more effective and efficient than state-of-the-art methods relying on either context-free annotation or keyword-based search.

Keywords: Semantic Annotation, Semantic Search, Language Models

Suggested Citation

Berlanga, Rafael and Nebot, Victoria and Pérez, Maria, Tailored Semantic Annotation for Semantic Search (2015). Journal of Web Semantics First Look. Available at SSRN: https://ssrn.com/abstract=3199176 or http://dx.doi.org/10.2139/ssrn.3199176

Rafael Berlanga (Contact Author)

Jaume I University - Department of Computer Languages and Systems

Castellon
E-12071 Castello de la Plana
Spain

Victoria Nebot

Jaume I University - Department of Computer Languages and Systems ( email )

Castellon
E-12071 Castello de la Plana
Spain

Maria Pérez

Jaume I University - Department of Computer Languages and Systems

Castellon
E-12071 Castello de la Plana
Spain

Register to save articles to
your library

Register

Paper statistics

Abstract Views
118
Downloads
2