Hello all, I do not understand how to set up Self-play rewards. I am designing a boxing game, every hit should give points. But having only positive rewards leads to consistent increase in ELO, I wont mess with negative rewards. So how I just say: if rewards of agent1 > of agent2 agent, a1 wins? ELO should have nothing to do with rewards.