Public Utility for What?: Governing AI Datastructures

Yale Journal of Law and Technology, vol. 27 (2025).

33 Pages Posted: 27 May 2025 Last revised: 21 Nov 2025

See all articles by Julie E. Cohen

Julie E. Cohen

Georgetown University Law Center

Date Written: May 04, 2025

Abstract

Both in the U.S. and in Europe, initiatives for AI governance have focused principally on identifying and mitigating the risks created by AI models and their downstream uses rather than on those created by the datasets on which the models are trained. As this paper will explain, some of the most intractable dysfunctions of generative AI systems involve datasets. In particular, the very large datasets amassed by dominant providers of generative AI and related services are rapidly taking on infrastructural characteristics and importance. Effective AI governance therefore requires an infrastructural turn in thinking about data. 

First, the paper explains the significance of the infrastructure lens and sketches some of the distinctive implications of data infrastructures, in particular, for governance of networked digital processes and the social and economic activities that they facilitate. Next, it explores two interrelated problems manifesting within generative AI systems-simulation and sociopathy-that illustrate the extent to which the project of AI governance is, unavoidably, a data governance project. In brief, generative AI models trained on content from the public internet are also trained on data infrastructures that have been developed in particular ways for particular purposes and that encourage the production and spread of particular kinds of content. Last, the paper considers whether the concept of public utility, now the subject of growing interest among legal scholars who study various regulated industries, might supply a possible foundation for tackling the data governance problems associated with generative AI systems. The public utility model, however, addresses only some of the considerations that the infrastructure lens highlights. It is highly attuned to questions about access to infrastructures and their outputs but relatively insensitive to questions about infrastructure configuration and input sourcing. The problems of simulation and sociopathy belong in the latter category.

Suggested Citation

Cohen, Julie E., Public Utility for What?: Governing AI Datastructures (May 04, 2025). Yale Journal of Law and Technology, vol. 27 (2025)., Available at SSRN: https://ssrn.com/abstract=5266513 or http://dx.doi.org/10.2139/ssrn.5266513

Julie E. Cohen (Contact Author)

Georgetown University Law Center ( email )

600 New Jersey Avenue, NW
Washington, DC 20001
United States
202-662-9871 (Phone)
202-662-9411 (Fax)

HOME PAGE: http://www.law.georgetown.edu/faculty/jec/

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
559
Abstract Views
2,418
Rank
127,842
PlumX Metrics