A Big Data Analytics Framework for Scientific Data Management
Posted: 5 Sep 2014
Date Written: December 2013
Abstract
The Ophidia project is a research effort addressing big data analytics requirements, issues, and challenges for eScience. We present here the Ophidia analytics framework, which is responsible for atomically processing, transforming and manipulating array-based data. This framework provides a common way to run on large clusters analytics tasks applied to big datasets. The document highlights the design principles, algorithm, and most relevant implementation aspects of the Ophidia analytics framework. Some experimental results, related to a couple of data analytics operators in a real cluster environment, are also presented.
Keywords: big data, data analytics, parallel I/O, eScience
Suggested Citation: Suggested Citation