|
||||
|
||||
Model Selection Using Database Characteristics: Classification Methods and an Application to the 'HMM and Its Children'Eric M. SchwartzUniversity of Pennsylvania - Marketing Department Eric BradlowUniversity of Pennsylvania - Marketing Department Peter FaderUniversity of Pennsylvania - Marketing Department June 17, 2012 Abstract: When managers and researchers encounter a dataset, they typically ask two key questions: (1) which model (from a candidate set) should be used? and (2) if I use a particular model, when is it going to likely work well for my business goal? This research addresses those two questions, and provides a rule for data analysts to portend the "winning model" before having to fit any of them. We characterize datasets based on managerially relevant (and easy-to-compute) summary statistics, and we use classification techniques from machine learning to provide a decision tree that recommends when to use which model. We illustrate this method for a common marketing problem (i.e., forecasting repeat purchasing for a cohort of new customers) and demonstrate the method's ability to discriminate among an integrated family of probability models that we call the "HMM and its children." We observe a strong ability for dataset characteristics to guide the choice of the most appropriate model, and observe that some model features (e.g., the "back-and-forth" migration between latent states) are more important to accommodate than others (e.g., the inclusion of an "off" state with no activity). We also demonstrate the method's broad generality by providing directions for researchers to replicate this kind of model classification task in other managerial contexts (outside of repeat purchasing and the HMM framework).
Number of Pages in PDF File: 55 Keywords: data science, business intelligence, model selection, machine learning, classification tree, posterior predictive model checking, hidden Markov models, hierarchical Bayesian methods, random forests, forecasting JEL Classification: C11, C15, C22, C23, C51, C52, C53, M31 working papers seriesDate posted: June 18, 2012Suggested CitationContact Information
|
|
|||||||||||||||||||||||||||||
© 2013 Social Science Electronic Publishing, Inc. All Rights Reserved.
FAQ
Terms of Use
Privacy Policy
Copyright
This page was processed by apollo4 in 1.078 seconds