No. 38 Xueyuan Road
Haidian District
Beijing, Beijing 100871
China
reusable resources, online resource allocation, episodic reinforcement learning