Question While training ELO goes down at team swap

Discussion in 'ML-Agents' started by KaushalAgrawal, Oct 13, 2023.

    Dec 18, 2019
    I am developing a turn based 4 players (2 teams of 2) card game.
    I have created 2 groups, each with 2 agents with same behaviour parameters and having same team id.
    That is
    Group A - agent0 (team id=0), agent2 (team id=0)
    Group B - agent1 (team id=1), agent3 (team id=1)
    This is a partner based game.

    Group Reward:-
    1-(turns taken to win /100f) to winning group
    -1 to loosing Group
    0 when is a draw.

    Agent reward:-
    -0.001 if a card which agent doesn't have is tried to play.

    While training, elo drops initially with Group mean reward being negative, later upto 200000 steps it increases.
    Then after 200000 (team swap is at 200000) elo decreases again below 1200.
    Then again goes up at 400000 steps and so on...

    Please guide me as to where a d what am I doing wrong and how could I fix this so as to get an increasing elo graph.

    ML-agents 20 (0.30.0)
    Torch 1.9.1