Unlocking Platform Data for Research

49 Pages Posted: 25 Oct 2024 Last revised: 12 Jan 2025

See all articles by Niva Elkin-Koren

Niva Elkin-Koren

Tel-Aviv University - Faculty of Law

Maayan Perel (Filmar)

Netanya Academic College

Ohad Somech

Bar-Ilan University, Faculty of Law, Law, Data Sciences, and Digital Ethics Lab

Multiple version iconThere are 2 versions of this paper

Date Written: August 1, 2024

Abstract

Digital platforms, which control unique access points to the rich data stored on their servers, have become a “living lab” of real-time information. Scientists and researchers increasingly use platform data for various purposes, such as training machine learning (ML) systems and Natural Language Processing (NLP) models, and for studying diverse fields such as medicine, humanities, and social sciences, including the influence of digital platforms on society. However, researchers increasingly encounter significant barriers, when attempting to access platform data. Although platforms typically lack proprietary rights over the data itself, they exert strong control over its use by imposing digital locks and boilerplate contractual limitations. Faced with the legal risk of potential breach-of-contract lawsuits filed by well-funded platforms, researchers may simply opt to steer clear of platform data research. This Article proposes private-law-centered solutions to overcome platform data lockout. First, researchers who access and use platform data without explicit permission should be able to contest breach-of-contract claims made against them by claiming copyright preemption. Platform data falls under copyright law, either because it is protected by copyright (such as user-generatedcontent) or because it constitutes basic "building blocks", such as users’ digital data trails, which are specifically excluded from copyright protection. When platforms robustly ban any reproduction of data, they effectively benefit from quasi-copyright protection, through private ordering, albeit compromising fundamental copyright principles, including fair use. Their contractual claims should, therefore, be preempted by copyright law. Second, courts should facilitate platform data research by narrowly interpreting boilerplate contractual bans on data access. Third, nuisance law may further support platform data research by empowering researchers to demand the removal of technological barriers that hinder access to public, nonproprietary data. Private law solutions to platform data lockout, however, do not grant researchers an affirmative right to use platform data for research. Legislative action of the type recently pursued by the European Union is required to establish such a Right to Research. This Article therefore concludes by examining regulatory approaches to platform data lockout, concluding that combining private law solutions with regulatory intervention offers the most effective means of adequately facilitating platform data research.

Suggested Citation

Elkin-Koren, Niva and Perel (Filmar), Maayan and Somech, Ohad, Unlocking Platform Data for Research (August 1, 2024). Indiana L. J. (forthcoming, 2025), Bar Ilan University Faculty of Law Research Paper No. 4914076, Available at SSRN: https://ssrn.com/abstract=4914076 or http://dx.doi.org/10.2139/ssrn.4914076

Niva Elkin-Koren (Contact Author)

Tel-Aviv University - Faculty of Law ( email )

Ramat Aviv
Tel Aviv, 6997801
Israel

Maayan Perel (Filmar)

Netanya Academic College ( email )

1 Unisversity Street
Netanya
Netanya, 31905
Israel

Ohad Somech

Bar-Ilan University, Faculty of Law, Law, Data Sciences, and Digital Ethics Lab ( email )

Faculty of Law
Ramat Gan, 52900
Israel

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
37
Abstract Views
204
PlumX Metrics