Search Unity

Multi criteria/multi objective in the reward: adaptative weight ?

Discussion in 'ML-Agents' started by ucax217, May 14, 2022.

  1. ucax217

    ucax217

    Joined:
    Nov 8, 2018
    Posts:
    8
    Hi,

    I a an agent with several term/criteria/objective in the computation of the reward.

    I have setup a curriculum learning by changing the weight of each criteria. At first the weights are set to focus on the easy and essential things, then the weights change to give importance to the details that will gain quality.
    It works a little but not enough.

    I would like to know if there is algorithm in Ml-Agent to automatically adapte the weight in multi-objective reward.

    I have found some papers like this :
    Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control
    Jie Xu, Yunsheng Tian, Pingchuan Ma, Daniela Rus, Shinjiro Sueda, Wojciech Matusik
    International Conference on Machine Learning (ICML) 2020
    http://pgmorl.csail.mit.edu/

    Thank you