A Unity ID allows you to buy and/or subscribe to Unity products and services, shop in the Asset Store and participate in the Unity community.
Separate names with a comma.
I would start simple. In this case, I would - limit the episode length, but give 1/"maximum episode length" as penalty every step (to encourage...
My answer is kind of late, sorry. Think about it this way: your observations will always be collected and then processed all at once. So it will...
I saw a video on YouTube on old version of mlagents where an episode length of 1 caused problems. Can you try making your episode at least 2 steps...
mlagents is probably processing all of the collected observations and training the neural net. During that time it can't provide Unity with new...
Oh, SAC? I have no intuition about that algorithm, and don't have a clue what the problem could be, sorry.
Can you please show us your config-file? Also, how many (combined) agents do you have across those 8 environments?
I just ran some benchmarks on my environment a few days ago. I reduced the max_steps in config.yaml to a small number so the training will...
Hello ruoping, thank you for the answers. Yeah, I had a few misconceptions about mlagents when I wrote that post, but I think I understand now....
Nvm, I'm not on the newest release and there have already been changes to this method in https://github.com/Unity-Technologies/ml-agents/pull/5192/
Hello everyone, I was looking at the timers.json-file from my runs, specifically at the "_update_policy" section. "_update_policy" only has 1...
I skimmed over the examples and "WallJump" has the following default-settings:...
Hello everyone, I could use some insights into how ml-agents deals with trajectories that include observations from a previous policy (in my...
Nevermind, it's probably the (average) number of agents that reached a certain reward at the end of the episode. No idea why I didn't think of...
Release 13 added the "histogram" graph to tensorboard. But I can't figure out how the graph is connected to the generated rewards. Take this...
Had a problem with h5py yesterday as well. Solved it by using python 3.7.
Yes! I only discovered it recently, but I find it *very* helpful.
Hello everyone, I've been tweaking my environment and playing with hyperparameters for quite some time, and finally reached a point where I'm...
Hello everyone, this is a question to those users, who've managed to build up intuition about reading tensorboard-graphs. Or maybe those that...
Hello everyone, is there an easy way to step through the source-code of ML-agents while debugging? Let's say I want to see the code of Agent.cs....
There is a bunch of links here...