The Hong Kong University of Science and Technology
Endogeneity, Markov Decision Process, Instrumental Variable, Reinforcement Bias, Reinforcement Learning, Q-Learning, Actor-Critic, Stochastic Approximation
Self-fulfilling Bias, Dynamic Selection, Endogeneity Spillover, Contextual Multi-armed Bandit Model
newsvendor; pricing; risk hedging; mean-variance framework