Prompt Provenance: Toward Traceable LLM Interactions

Procko, Tyler; Vonder Haar, Lynn; Elvira, Timothy; Ochoa, Omar

doi:10.2139/ssrn.5682942

Download This Paper

Open PDF in Browser

Add Paper to My Library

Prompt Provenance: Toward Traceable LLM Interactions

5 Pages Posted: 14 Nov 2025

See all articles by Tyler Procko

Omar Ochoa

Embry-Riddle Aeronautical University

Date Written: October 07, 2025

Abstract

Large Language Models (LLMs) operate as black boxes, with transient inputs and outputs, i.e., there is little trace of the context, agents, or data that shapes responses. Yet every LLM interaction is a provenance event: an activity producing an artifact under the influence of agents and prior states. This paper introduces the Prompt Provenance Model (PPM), a conceptual model for representing the lineage of prompts, completions, and dialogue histories using the PROV framework of the World Wide Web Consortium (W3C). The PPM extends PROV-O to treat prompts as first-class entities, defining relations between user intent, retrieval sources, system messages, and generated artifacts. It is posited that capturing prompt-level provenance is essential for auditability, explainability, and regulatory compliance in LLM ecosystems. Example applications demonstrate its use for research reproducibility, model debugging, and forensic accountability. This paper contends that prompt provenance is foundational to the trustworthy deployment of Findable, Accessible, Interoperable, and Reusable (FAIR) generative AI.

Keywords: LLM, Prompt Engineering, Provenance, PROV-O, Fair

Suggested Citation: Suggested Citation

Procko, Tyler and Vonder Haar, Lynn and Elvira, Timothy and Ochoa, Omar, Prompt Provenance: Toward Traceable LLM Interactions (October 07, 2025). Available at SSRN: https://ssrn.com/abstract=5682942 or http://dx.doi.org/10.2139/ssrn.5682942