Data Basis (Base Dos Dados): Universalizing Access to High-Quality Data
22 Pages Posted: 28 Jul 2022
Date Written: July 5, 2022
Abstract
In this paper we explain how the Data Basis platform helps decisively solve the data access problem for different types of users. We describe its core products: a powerful search engine, a freely accessible data lake featuring a unified schema and hundreds of interoperable tables, and APIs in various programming languages. We exemplify the platform’s utility with discussions of three datasets on labor markets, elections, and local public finances in Brazil. The project is extraordinarily cost-effective: dividing a measure of yearly benefits generated by a conservative estimate of yearly costs to run the organization yields a lower bound social return of 74. We conclude by laying out a roadmap to guide the organization’s future steps.
Keywords: Open data, big data, administrative data, search, data lake
JEL Classification: C8, O12
Suggested Citation: Suggested Citation