Piazza Leonardo da Vinci
Milan, 20100
Italy
Polytechnic University of Milan
Human-in-the-Loop optimization, Reinforcement Learning from Human Feedback (RLHF), Preferential Bayesian Optimization (PBO), Active learning, Preference-based optimization, Large Language Models (LLMs), High-dimensional optimization.