Yijie Peng

Peking University

No 5 Yiheyuan Rd

Haidian District

Beijing, Beijing 100871

China

SCHOLARLY PAPERS

9

DOWNLOADS

709

SSRN CITATIONS

4

CROSSREF CITATIONS

1

Scholarly Papers (9)

1.

A New Likelihood Ratio Method for Training Artificial Neural Networks

Number of pages: 33 Posted: 07 Feb 2019 Last Revised: 11 May 2021
Peking University, Chinese Academy of Sciences (CAS), VU University Amsterdam, Hong Kong University of Science & Technology (HKUST) and Columbia University
Downloads 193 (240,646)

Abstract:

Loading...

simulation, stochastic gradient estimation, artificial neural network, image identification

2.

Multi-Agent Deep Reinforcement Learning for Multi-Echelon Inventory Management

Rotman School of Management Working Paper No. 4262186
Number of pages: 39 Posted: 05 Dec 2022
Xiaotian Liu, Ming Hu, Yijie Peng and Yaodong Yang
Peking University, University of Toronto - Rotman School of Management, Peking University and Peking University
Downloads 191 (245,219)

Abstract:

Loading...

Multi-Echelon Inventory Management, Multi-Agent Deep Reinforcement Learning, Bullwhip Effect

3.

A Stochastic Approximation Method for Simulation-Based Quantile Optimization

Number of pages: 40 Posted: 11 May 2021
Jiaqiao Hu, Yijie Peng, Gongbo Zhang and Qi Zhang
State University of New York (SUNY) - Stony Brook, Students, Peking University, affiliation not provided to SSRN and State University of New York (SUNY) - Stony Brook, Students
Downloads 125 (343,739)
Citation 1

Abstract:

Loading...

Quantile sensitivities, stochastic approximation, simulation optimization

4.

Sensitivity Analysis of Portfolio Credit Derivatives by Conditional Monte Carlo Simulation

Number of pages: 26 Posted: 20 Jun 2019
Lei Lei, Yijie Peng, Michael Fu and Jianqiang Hu
Chongqing University, Peking University, University of Maryland - College Park and Fudan University
Downloads 80 (462,079)
Citation 4

Abstract:

Loading...

simulation, stochastic gradient estimation, credit derivative, conditional Monte Carlo, copula model

5.

Estimating Confidence Intervals and Regions for Quantiles by Monte Carlo Simulation

Number of pages: 46 Posted: 18 Nov 2021
Chongqing University, Georgia Institute of Technology, Peking University and affiliation not provided to SSRN
Downloads 45 (616,260)

Abstract:

Loading...

Monte Carlo simulation, quantiles, confidence intervals, confidence regions, sensitivity analysis

6.

Optimal Unbiased Estimation for Expected Cumulative Cost

Stevens Institute of Technology School of Business Research Paper
Number of pages: 13 Posted: 30 Apr 2018
Stevens Institute of Technology - School of Business, University of Maryland - College Park, Peking University and Florida State University
Downloads 30 (701,657)
Citation 1

Abstract:

Loading...

unbiased simulation, optimal control, cumulative cost, efficiency of estimator

7.

Online Appendix for 'Efficient Sampling Policy for Selecting a Subset with the Best'

IEEE Transactions on Automatic Control
Number of pages: 13 Posted: 22 Feb 2022 Last Revised: 15 Sep 2022
Guanghua School of Management, Peking University, National University of Defense Technology - College of Systems Engineering, Tsinghua University - Department of Automation and Peking University
Downloads 29 (708,585)

Abstract:

Loading...

Ranking and selection, sequential sampling, stochastic control, Bayesian, subset selection

8.

Training Deep Q-Network via Monte Carlo Tree Search for Adaptive Bitrate Control in Video Delivery

Number of pages: 40 Posted: 11 Dec 2022
Xiaotian Liu, Henry Lam and Yijie Peng
Peking University, Columbia University and Peking University
Downloads 13 (837,119)

Abstract:

Loading...

video delivery, reinforcement learning, deep Q network, monte carlo tree search, common random number

9.

Online Appendix for 'Efficient Learning for Selecting Top-m Context-Dependent Designs'

Number of pages: 8 Posted: 04 Jan 2023
Gongbo Zhang, Sihua Chen and Yijie Peng
Guanghua School of Management, Peking University, Independent and Peking University
Downloads 3 (926,753)

Abstract:

Loading...

Simulation optimization; context-dependent decision; top-m selection; dynamic sampling; asymptotic optimality.