Proximal Policy Optimization Algorithm for Dynamic Pricing with Online Reviews

25 Pages Posted: 2 Aug 2022

See all articles by Chao Wu

Chao Wu

Central South University

bi wenjie

Central South University

Haiying Liu

Hunan University of Finance and Economics

Abstract

This study investigates whether the presence of both quality- and value-based online reviews help firms make decisions. To adapt to a complex real-world environment, we construct two simulated environments with high and low initial consumer-perceived quality and employ a Proximal Policy Optimization algorithm (PPO) to derive optimal pricing strategies. The simulation results show that retailers can gain higher revenue by considering quality-based reviews only when the consumers' initial perceived quality is low. In addition, retailers must choose an appropriate promotion method based on the social learning speed of the consumer group. When the social learning speed is slow, retailers should invest more in promotion costs to improve the initial perceived quality of consumers and thus increase revenue. Compared to the Advantage Actor-Critic algorithm, the PPO algorithm exhibits better performance, provides a new approach for complex and continuous revenue management problems, and can be applied to a wider range of areas. Keywords

Keywords: Social Learning, Proximal Policy Optimization Algorithm, Advantage Actor-Critic Algorithm, Dynamic Pricing, Quality-based Review, Value-based Review

Suggested Citation

Wu, Chao and wenjie, bi and Liu, Haiying, Proximal Policy Optimization Algorithm for Dynamic Pricing with Online Reviews. Available at SSRN: https://ssrn.com/abstract=4179218 or http://dx.doi.org/10.2139/ssrn.4179218

Chao Wu

Central South University ( email )

Changsha, 410083
China

Bi Wenjie

Central South University ( email )

Haiying Liu (Contact Author)

Hunan University of Finance and Economics ( email )

China

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
216
Abstract Views
739
Rank
306,402
PlumX Metrics