affiliation not provided to SSRN
deep reinforcement learning, capacitated lot sizing, non-stationary demand