Turning Fake Data into Fake News: The A.I. Training Set as a Trojan Horse of Misinformation

San Diego Law Journal, Forthcoming

34 Pages Posted: 25 Jul 2023

See all articles by Bill Tomlinson

Bill Tomlinson

University of California, Irvine; Victoria University of Wellington - Te Herenga Waka

Donald Patterson

Westmont College; University of California, Irvine

Andrew W. Torrance

University of Kansas School of Law; MIT Sloan School of Management

Date Written: July 19, 2023

Abstract

Generative artificial intelligence (“A.I.”) offers tremendous benefits to society. However, these benefits must be carefully weighed against the societal damage A.I. can also cause. Dangers posed by inaccurate training sets have been raised by many authors. These include racial discrimination, sexual bias, and other pernicious forms of misinformation. One remedy to such problems is to ensure that training sets used to teach A.I. models are correct and that the data upon which they rely are accurate. An assumption behind this correction is that data inaccuracies are inadvertent mistakes. However, a darker possibility exists: the deliberate seeding of training sets with inaccurate information for the purpose of skewing the output of A.I. models toward misinformation. As United States Supreme Court Justice Oliver Wendell Holmes, Jr., suggested, laws are not written for the “good man”, because good people will tend to obey moral and legal principles in manners consistent with a well functioning society even in the absence of formal laws. Rather, Holmes proposed, laws should be written with the “bad man” in mind, because bad people will push the limits of acceptable behavior, engaging in cheating, dishonesty, crime, and other societally-damaging practices, unless constrained by carefully-designed laws and their accompanying penalties.

This article raises the spectre of the deliberate sabotage of training sets used to train A.I. models, with the purpose of perverting the outputs of such models. Examples include fostering revisionist histories, unjustly harming or rehabilitating the reputations of people, companies, or institutions, or even promoting as true ideas that are not. Strategic and clever efforts to introduce ideas into training sets that later manifest themselves as facts could aid and abet fraud, libel, slander, or the creation of “truths,” the belief in which promote the interests of particular individuals or groups. Imagine, for example, a first investor who buys grapefruit futures, who then seeds training sets with the idea that grapefruits will become the new gold, with the result that later prospective investors who consult A.I. models for investment advice are informed that they should invest in grapefruit, enriching the first investor. Or, consider a malevolent political movement that hopes to rehabilitate the reputation of an abhorrent leader; if done effectively, this movement could seed training sets with sympathetic information about this leader, resulting in positive portrayals of this leader in the future outputs of trained A.I. models.

This article adopts the cautious attitude necessitated by Holmes’ Bad Man, applying it to proactively stopping, or retroactively punishing and correcting, deliberate attempts to subvert the training sets of A.I. models. It offers legal approaches drawn from doctrines ranging from fraud, nuisance, libel, and slander, to misappropriation, privacy, and right of publicity. It balances these with protections for speech afforded by the First Amendment and other doctrines of free speech. The result is the first comprehensive attempt to prevent, respond to, and correct deliberate attempts to subvert training sets of A.I. models for malicious purposes.

Keywords: Artificial intelligence, AI, fake news, training set, misinformation

Suggested Citation

Tomlinson, Bill and Patterson, Donald and Torrance, Andrew W., Turning Fake Data into Fake News: The A.I. Training Set as a Trojan Horse of Misinformation (July 19, 2023). San Diego Law Journal, Forthcoming, Available at SSRN: https://ssrn.com/abstract=4515571

Bill Tomlinson (Contact Author)

University of California, Irvine ( email )

Bren Hall
Irvine, CA 92697-3440
United States

Victoria University of Wellington - Te Herenga Waka ( email )

P.O. Box 600
Wellington, 6140
New Zealand

Donald Patterson

Westmont College ( email )

United States
8055657028 (Phone)

HOME PAGE: http://www.djp3.net

University of California, Irvine ( email )

Campus Drive
Irvine, CA California 62697-3125
United States

Andrew W. Torrance

University of Kansas School of Law ( email )

Green Hall
1535 W. 15th Street
Lawrence, KS 66045-7577
United States

MIT Sloan School of Management ( email )

100 Main Street
Cambridge, MA 02142
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
236
Abstract Views
833
Rank
248,125
PlumX Metrics