Eindhoven University of Technology (TUE)
Deep Reinforcement Learning, Proximal Policy Optimization, Multi-Echelon, Inventory Control, Backorders
deep reinforcement learning, capacitated lot sizing, non-stationary demand