Resolved Self-Play - How do the agents know who the teammates are?

MidnightGameDeveloper · Sep 30, 2020

Hello,

I am trying to train intelligent adversaries using self-play. What I don't understand is how the agents know which tag (detected by raycast) belongs to the enemies and which are teammates since the teams that are learning getting switched during training. Do the agents know their Team ID or how do they learn who are teammates and who not?

I am also wondering if there is a way to see which team is currently training?

Are there any recommendations on which value should be set for stacked vectors for a quite dynamic game/environment where the agents drive cars? (I am using PPO)

henrypeteet · Sep 30, 2020

Thanks for the questions.

Going off of the soccer example one way to represent symmetric teams is to tag each team's agents and include that tag under "Detectable Tags". But make sure to swap the input ordering based on team. In the soccer example it winds up looking like this for each agent, notice the flipped entries.

Blue:

Purple

The team ID is not fed into the network by default

I don't think there is a direct way to check which team is training but you can derive it from the number of steps taken and the value you set for `team_change` in your .yaml config

current_team = floor(steps / team_change) % num_teams

I don't think we have a recommended value specifically for driving but others on the forum may be of more use.

MidnightGameDeveloper · Oct 1, 2020

henrypeteet said: ↑

Thanks for the questions.

Going off of the soccer example one way to represent symmetric teams is to tag each team's agents and include that tag under "Detectable Tags". But make sure to swap the input ordering based on team. In the soccer example it winds up looking like this for each agent, notice the flipped entries.

Blue: View attachment 708510

Purple View attachment 708663

The team ID is not fed into the network by default

I don't think there is a direct way to check which team is training but you can derive it from the number of steps taken and the value you set for `team_change` in your .yaml config

current_team = floor(steps / team_change) % num_teams

I don't think we have a recommended value specifically for driving but others on the forum may be of more use.

Click to expand...

Hello,
thank you for taking the time to answer my questions. I knew I must have missed something. Swapping the order of the "Detectable tags"-List based on the team makes total sense, I hope that this will improve the training in my project .

Search Unity

Resolved Self-Play - How do the agents know who the teammates are?

MidnightGameDeveloper

henrypeteet

Unity Technologies

MidnightGameDeveloper

Search Unity

Unity ID

Useful Searches

Resolved Self-Play - How do the agents know who the teammates are?

MidnightGameDeveloper

henrypeteet

Unity Technologies

MidnightGameDeveloper