Actor-critic multi-objective reinforcement learning for non-linear utility functions April 2023 · Mathieu Reymond