Data Basis (Base Dos Dados): Universalizing Access to High-Quality Data

22 Pages Posted: 28 Jul 2022

See all articles by Ricardo Dahis

Ricardo Dahis

Monash University; Data Basis

João Carabetta

Independent

Fernanda Scovino

Independent

Frederico Israel

Google Inc.

Diego Oliveira

Independent

Date Written: July 5, 2022

Abstract

In this paper we explain how the Data Basis platform helps decisively solve the data access problem for different types of users. We describe its core products: a powerful search engine, a freely accessible data lake featuring a unified schema and hundreds of interoperable tables, and APIs in various programming languages. We exemplify the platform’s utility with discussions of three datasets on labor markets, elections, and local public finances in Brazil. The project is extraordinarily cost-effective: dividing a measure of yearly benefits generated by a conservative estimate of yearly costs to run the organization yields a lower bound social return of 74. We conclude by laying out a roadmap to guide the organization’s future steps.

Keywords: Open data, big data, administrative data, search, data lake

JEL Classification: C8, O12

Suggested Citation

Dahis, Ricardo and Carabetta, João and Scovino, Fernanda and Israel, Frederico and Oliveira, Diego, Data Basis (Base Dos Dados): Universalizing Access to High-Quality Data (July 5, 2022). Available at SSRN: https://ssrn.com/abstract=4157813 or http://dx.doi.org/10.2139/ssrn.4157813

Ricardo Dahis (Contact Author)

Monash University ( email )

Wellington Road
Clayton, Victoria 3
Australia

Data Basis ( email )

Rio de Janeiro
Brazil

João Carabetta

Independent

Fernanda Scovino

Independent

Frederico Israel

Google Inc.

Diego Oliveira

Independent

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
53
Abstract Views
288
Rank
681,640
PlumX Metrics