Drive More Effective Data-Based Innovations: Enhancing the Utility of Secure Databases

Yi Qian

Northwestern University - Kellogg School of Management

Hui Xie

University of Illinois

October 2013

NBER Working Paper No. w19586

Databases play a central role in evidence-based innovations in business, economics, social, and health sciences. In modern business and society, there are rapidly growing demands for constructing analytically valid databases that also are secure and protect sensitive information in order to meet customer and public expectations, to minimize financial losses, and to comply with privacy regulations and laws. We propose new data perturbation and shuffling (DPS) procedures, named MORE, for this purpose. As compared with existing DPS methods, MORE can substantially increase the utility of secure databases without increasing disclosure risk. MORE is capable of preserving important nonmonotonic relationships among attributes, such as the inverted-U relationship between competition and innovation. Maintaining such relationships is often the key to determining optimal levels of policy and managerial interventions. MORE does not require data to be of particular types or have particular distributional shapes. Instead, it provides unified, flexible, and robust algorithms to mask general types of confidential variables with arbitrary distributions, thereby making it suitable for general-purpose data masking. Since MORE nests the commonly used generalized linear models as special cases, a much wider range of statistical analyses can be conducted using the secure databases with results similar to those using the original databases. Unlike existing DPS approaches which typically require a joint model for all variables, MORE requires no modeling of nonconfidential variables, and thus further increases the robustness of secure databases. Evaluation of MORE through Monte Carlo simulation studies and empirical applications demonstrates that it performs better than existing data masking methods.

Number of Pages in PDF File: 38

Open PDF in Browser Download This Paper

Date posted: October 25, 2013  

Suggested Citation

Qian, Yi and Xie, Hui, Drive More Effective Data-Based Innovations: Enhancing the Utility of Secure Databases (October 2013). NBER Working Paper No. w19586. Available at SSRN: https://ssrn.com/abstract=2345080

Contact Information

Yi Qian (Contact Author)
Northwestern University - Kellogg School of Management ( email )
2001 Sheridan Road
Evanston, IL 60208
United States
Hui Xie
University of Illinois ( email )
1200 W Harrison St
Chicago, IL 60607
United States
Feedback to SSRN

Paper statistics
Abstract Views: 365
Downloads: 6
Paper comments
No comments have been made on this paper