Search Unity

Resolved Adding self-play in mid training?

Discussion in 'ML-Agents' started by mbaske, Mar 24, 2021.

  1. mbaske

    mbaske

    Joined:
    Dec 31, 2017
    Posts:
    473
    Hi, my agent requires an initial round of training for learning some physics related behaviour, like self-balancing. Once it has that figured out, it needs to compete against others. Apparently it's possible to start training without self-play, pause at some point, add self-play params to the config yaml and then resume without getting any errors. I just wanted to make sure this is a viable option and self-play will work as expected if it is added later on like this. Thanks!
     
  2. christophergoy

    christophergoy

    Joined:
    Sep 16, 2015
    Posts:
    735
    My intuition is that it should be OK, since self-play just pits policies against one another and doesn’t change the architecture of the network. I’ll poke our research guys and get back to you.
     
    mbaske likes this.
  3. christophergoy

    christophergoy

    Joined:
    Sep 16, 2015
    Posts:
    735
    confirmed. it's ok.
     
  4. mbaske

    mbaske

    Joined:
    Dec 31, 2017
    Posts:
    473
    Great, thank you!