Learning Personalized Privacy Preference From Public Data

Forthcoming at Information Systems Research

55 Pages Posted: 19 Jun 2023 Last revised: 18 May 2024

See all articles by Wen Wang

Wen Wang

University of Maryland - Robert H. Smith School of Business

Beibei Li

Carnegie Mellon University - H. John Heinz III School of Public Policy and Management

Date Written: August 29, 2022

Abstract

Learning personalized consumers’ privacy preferences is crucial for firms and policymakers to establish trust and compliance and guides effective policy-making. Existing approaches rely mostly on private information such as proprietary user behavior data, individual-level demographic and socio-economic factors, or require explicit user input, which can be invasive and burdensome, potentially leading to user dissatisfaction. Nowadays, individuals generate and share vast amounts of information about themselves in the public domain, which can provide a valuable multifaceted view of their behaviors, attitudes, and preferences. This information thus has the potential to provide valuable insights into individuals’ privacy preferences. In this study, we propose a novel framework to predict personalized privacy preference by leveraging a ubiquitous source of public data - social media posts. Deeply rooted in psychological and privacy theories, we use deep learning model and natural language processing algorithms to learn theory-driven psycho-social traits such as lifestyle, risk preference, personality, privacy-related economic preferences, linguistic styles, and more from social media posts. Interestingly, we find that psycho-social traits from public data provide greater predictive power than private information. Furthermore, we conduct multiple interpretability analyses to understand what drives the model’s performance. Finally, we demonstrate the practical value of our model and show that our framework can assist platforms and policymakers in forecasting the consequences of privacy policies. Overall, our framework provides managerial implications for enhancing consumer privacy control and trust, optimizing platform data management, and informing policymakers about better data privacy regulations.

Keywords: Personalized Privacy Preference, Public Data Source, Deep Learning, Natural Language Processing, Psycho-social Traits

Suggested Citation

Wang, Wen and Li, Beibei, Learning Personalized Privacy Preference From Public Data (August 29, 2022). Forthcoming at Information Systems Research, Available at SSRN: https://ssrn.com/abstract=4483615

Wen Wang (Contact Author)

University of Maryland - Robert H. Smith School of Business ( email )

Beibei Li

Carnegie Mellon University - H. John Heinz III School of Public Policy and Management ( email )

Pittsburgh, PA 15213-3890
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
571
Abstract Views
2,399
Rank
119,474
PlumX Metrics