Multi-Task Best Arm Identification with Risk Constraints

43 Pages Posted: 1 May 2025

Date Written: October 01, 2024

Abstract

Best Arm Identification (BAI) is a challenging problem in sequential decision-making with numerous realworld applications. Existing approaches typically assume all arms are feasible or address expectation-based constraints, often relying on strong assumptions, loose sample complexity bounds, and non-optimal solutions. This paper introduces a multi-task BAI problem with risk constraints in the fixed-confidence setting, where each arm is evaluated across multiple performance metrics. The agent seeks to optimize one metric while ensuring that the quantiles of other metrics remain below specified thresholds for each task. We first derive a tight, instance-dependent lower bound on sample complexity. Using this bound, we define optimality conditions for the static optimal sampling ratio, demonstrating how it balances trade-offs between tasks, constraints, and the interplay of optimality and feasibility. We then propose a Track-and-Stop strategy with asymptotically optimal sample complexity, complemented by a computationally efficient strategy that iteratively refines the optimality conditions. Numerical experiments confirm that our algorithm performs competitively.

Keywords: Best arm identification, risk constraints, fixed-confidence, sample complexity, multi-task

Suggested Citation

HU, Mingjie and Hu, Jianqiang, Multi-Task Best Arm Identification with Risk Constraints (October 01, 2024). Available at SSRN: https://ssrn.com/abstract=5214504 or http://dx.doi.org/10.2139/ssrn.5214504

Mingjie HU (Contact Author)

Fudan University ( email )

Jianqiang Hu

Fudan University ( email )

670 Guoshun Road
Siyuan Building, Room 508
Shanghai, 200433
China

HOME PAGE: http://www.fdsm.fudan.edu.cn/en/teacher/preview.aspx?UID=91946

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
9
Abstract Views
50
PlumX Metrics