SSRN Home Search and Download Papers Browse Abstract and Paper Submission Subscribe to Networks View Briefcase Top Papers Top Authors Top Institutions

 

Abstract

 


 



A Missing Data Paradox for Nearest Neighbor Recommender Systems

Daniel M. Fleder
University of Pennsylvania - The Wharton School

Kartik Hosanagar
University of Pennsylvania - The Wharton School


October 1, 2007


Abstract:     
Recommender systems typically work over sparse matrices. Although most methods assume so, these matrices' entries are often not missing at random (NMAR). How problematic is this? We present a puzzle. Some methods explicitly account for NMAR processes. This has been shown to improve predictions. Many methods, however, assume that entries are missing at random (MAR). While they may be wrong in that assumption, we show they may benefit nonetheless from its being violated. Given that some data must go missing, NMAR can often pick the "right" values to preserve (i.e. it preserves the more informative data). Thus despite the perception that NMAR is bad, it can often improve recommendations. This may explain some of the historical success of collaborative filtering even when this assumption has been violated.

Keywords: recommender systems, collaborative filtering, predictive modeling, missing data

Working Paper Series

Date posted: January 04, 2009 ; Last revised: January 04, 2009

Suggested Citation

Fleder, Daniel M. and Hosanagar, Kartik, A Missing Data Paradox for Nearest Neighbor Recommender Systems (October 1, 2007). Available at SSRN: http://ssrn.com/abstract=1322548


Export to: Export Citation What's this?

Contact Information

Daniel M. Fleder (Contact Author)
University of Pennsylvania - The Wharton School ( email )
Philadelphia, PA 19104
United States
Kartik Hosanagar
University of Pennsylvania - The Wharton School ( email )
3641 Locust Walk
Philadelphia, PA 19104-6365
United States
Feedback to SSRN (Beta)


Paper statistics
Abstract Views: 28
Downloads: 0

© 2009 Social Science Electronic Publishing, Inc. All Rights Reserved. Terms of Use  Privacy Policy
This page was served by apollo3 in 0.094 seconds.