A Unity ID allows you to buy and/or subscribe to Unity products and services, shop in the Asset Store and participate in the Unity community.
Separate names with a comma.
Episodes are local to the agent, so calling EndEpisode on one agent should not affect others.
Here's the thing: most ML algos are very complicated to implement. On-top of that they're very difficult to make computationally efficient....
What's strange is that this bottleneck only actually occurs when training. According to reports from rz_0lento, the 8 parallel environment setup...
AFAIK: You can fit multiple sequences within a single batch and learn from that, but since batch is the chunk size of state-action pairs used for...
Another ml-agents user in the discord followed the repro steps (mlagents version 14) and confirmed the problem. EDIT: I went ahead and opened up...
It sounds like the problem is with your scene. There is no way you are building up more than a gig of observations...
Did a bit of playing around, the issue is definitely a strange one. A sloppy fix to cut through the strangeness is simply adding a...
I think I found the final ingredient that should let you reproduce this issue in the 3D ball env: 1: Add a single 3DBallHardNew agent to the...
Yep, am using different behavior names. I will try re-installing at some point today I think. The agents have their own prefab. The prefabs are...
That's for the low-level python API I think. If you're using the terminal to launch training, try out this command: mlagents-learn --help Time...
This is exactly the problem. I am indeed requesting decisions for both agents, and both agent classes work fine if used individually. The sole...
If you're doing anything in Update rather than FixedUpdate (and are using physics at all) then you will probably see deviations...
You can definitely implement your own models using TF2, but it isn't simple to set up. Look at the models.py file and go from there (I can't...
I am trying to train two different sets of agents in the same environment, but at training time, only one behavior parameter will be loaded in the...
If you are using visual observations then this can be a cause of large buffer sizes, but there should be an upper limit to how much data is being...
Training does not necessarily use fixedDeltaTime to govern its updates, so training may not appear to be the same speed as when you inference on...
Only the final encoded result. That sounds plausible, but I'm not at all sure if thin features are sometimes biased out or not. to little...
Yup, sorry about that.
Nope. You could never define a "take over the world reward function" correctly. Only an evolutionary approach could write something so...
The problem is that if you pool and convolve too many times on a small image, it will end up with negative or puny dimensions on the other end....