Actor-Critic Multi-Objective Reinforcement Learning for Non-Linear Utility Functions July 2021 · Mathieu Reymond