A Unity ID allows you to buy and/or subscribe to Unity products and services, shop in the Asset Store and participate in the Unity community.
Separate names with a comma.
a 5x5 array is the same as a single one that has 25 entries. You can as well just loop over your 5x5 array and add each entry as an observation.
Your mean reward is as well really high. Are you rewarding the agent a lot? Are the rewards high in nature? (reward > 1). Are you having a small...
I solved it in the end with only using discrete actions and action masks. Since all discrete branches are always returning a value, I double...
I tried myself to get the agents working for the game of Splendor, a turn based card/ boardgame. It is still a work in progress with not all...
You could just add a negative reward if the agent moves too far away from it to prevent it going too much. Invisible walls would solve at least...
I'm trying to use the ML-Agents for a simplified Game of Splendor. And I have a problem with how to set up the correct behaviours to do a turn....