Towards Life Sciences Search with Blazing Speed
Posted: 6 Dec 2019 Last revised: 16 Dec 2019
Date Written: December 3, 2019
EMBASE (Excerpta Medica dataBASE) is a biomedical and pharmacological bibliographic system consisting of more than 37 million records from over 8,500 journals. It enables comprehensive tracking and retrieval of drug information. We show and benchmark several approaches to improve search efficiency and reliability by using advanced search techniques in the context of a transition from a NoSQL and XML database to a full-fledged search engine. This includes an overview of the scalable infrastructure topology, data modeling and optimization of the indexing and search schema, writing the optimal queries for search, and techniques to support efficient faceting and exports of large amounts of records.
Keywords: Biomedical search engine, search efficiency, life sciences, bibliographic search, search engine migration, evaluation
Suggested Citation: Suggested Citation