Hello everyone, im facing a strange behavior while training with multiple agents or multiple environments. This problem does not occur when training with a single agent: Whenever I start a training, everything works fine at the beginning. However after the second summary checkpoint is reached, the agents get stuck for a couple of seconds/minutes. Then suddenly the training continues and the output looks like this (In this example i train with 8 Environments): Some Informations about the environment: Im using a custom environment where everything is controlled by a counter (Almost like in the Gridworld example, but I count the actions taken and I set a Threshold for the maximum allowed actions/steps). A Decision is requested manually when a courotine has finished a webrequest (Agents are async). When the counter reaches a threshold the agent failed and EpisodeInterrupted() gets called. If the agents reaches the goal EndEpisode() gets called. The training with multiple agents still works. The result is a working Model, however these pausing caused by the summaries are wasting a lot of processing time. When i disable the summaries everything works fine, but without summaries its hard to check which training was the most successful.