Don't know if anyone can help me with a specific curious question I wasn't able to answer digging through the code: https://github.com/Unity-Technologi...ents/trainers/trainer_controller.py#L180-L181 Why is trainer_controller.reset_env_if_ready inside a n_steps loop? n_steps refers to the number of distinct UnityEnvWorkers that have stepped through their respective environments (as I understand it, this is if you wanted to train for num-envs > 1). Also, there is no particular check as to which env_worker might need a parameter update or reset, just that it resets all env_workers. Any insight? Thanks in advance.