When I train the agent to learn in my own environment, in the beginning, it works well, trainning is fast. But after a few steps the trainng is becoming more and more slow. It took me more than 1000s to summary 10000 steps. Why would this happened?