An Analytical Study of Utility Functions in Multi-Objective Reinforcement Learning

Rodriguez-Soto, Manel; Rodriguez-Aguilar, Juan A.; Lopez-Sanchez, Maite

An Analytical Study of Utility Functions in Multi-Objective Reinforcement Learning

Part of Advances in Neural Information Processing Systems 37 (NeurIPS 2024) Main Conference Track

Bibtex Paper

Authors

Manel Rodriguez-Soto, Juan A. Rodriguez-Aguilar, Maite Lopez-Sanchez

Abstract

Multi-objective reinforcement learning (MORL) is an excellent framework for multi-objective sequential decision-making. MORL employs a utility function to aggregate multiple objectives into one that expresses a user's preferences. However, MORL still misses two crucial theoretical analyses of the properties of utility functions: (1) a characterisation of the utility functions for which an associated optimal policy exists, and (2) a characterisation of the types of preferences that can be expressed as utility functions. As a result, we formally characterise the families of preferences and utility functions that MORL should focus on: those for which an optimal policy is guaranteed to exist. We expect our theoretical results to promote the development of novel MORL algorithms that exploit our theoretical findings.

An Analytical Study of Utility Functions in Multi-Objective Reinforcement Learning

Authors

Abstract

Name Change Policy