Abstract

http://ssrn.com/abstract=2345080
 


 



Drive More Effective Data-Based Innovations: Enhancing the Utility of Secure Databases


Yi Qian


Northwestern University - Kellogg School of Management

Hui Xie


University of Illinois

October 2013

NBER Working Paper No. w19586

Abstract:     
Databases play a central role in evidence-based innovations in business, economics, social, and health sciences. In modern business and society, there are rapidly growing demands for constructing analytically valid databases that also are secure and protect sensitive information in order to meet customer and public expectations, to minimize financial losses, and to comply with privacy regulations and laws. We propose new data perturbation and shuffling (DPS) procedures, named MORE, for this purpose. As compared with existing DPS methods, MORE can substantially increase the utility of secure databases without increasing disclosure risk. MORE is capable of preserving important nonmonotonic relationships among attributes, such as the inverted-U relationship between competition and innovation. Maintaining such relationships is often the key to determining optimal levels of policy and managerial interventions. MORE does not require data to be of particular types or have particular distributional shapes. Instead, it provides unified, flexible, and robust algorithms to mask general types of confidential variables with arbitrary distributions, thereby making it suitable for general-purpose data masking. Since MORE nests the commonly used generalized linear models as special cases, a much wider range of statistical analyses can be conducted using the secure databases with results similar to those using the original databases. Unlike existing DPS approaches which typically require a joint model for all variables, MORE requires no modeling of nonconfidential variables, and thus further increases the robustness of secure databases. Evaluation of MORE through Monte Carlo simulation studies and empirical applications demonstrates that it performs better than existing data masking methods.

Institutional subscribers to the NBER working paper series, and residents of developing countries may download this paper without additional charge at www.nber.org.

Number of Pages in PDF File: 38

working papers series


Date posted: October 25, 2013  

Suggested Citation

Qian, Yi and Xie, Hui, Drive More Effective Data-Based Innovations: Enhancing the Utility of Secure Databases (October 2013). NBER Working Paper No. w19586. Available at SSRN: http://ssrn.com/abstract=2345080

Contact Information

Yi Qian (Contact Author)
Northwestern University - Kellogg School of Management ( email )
2001 Sheridan Road
Evanston, IL 60208
United States
Hui Xie
University of Illinois ( email )
Chicago, IL 60612
United States
Feedback to SSRN


Paper statistics
Abstract Views: 80
Downloads: 2

© 2014 Social Science Electronic Publishing, Inc. All Rights Reserved.  FAQ   Terms of Use   Privacy Policy   Copyright   Contact Us
This page was processed by apollo6 in 1.141 seconds