In this project, the withe ball aims to reach the green box without collision with these red obstacles. These red obstacles move in this area randomly. I tried a lot of times with different reward functions (dense/sparse), and different observations such as grid sensors and ray cast sensors. I can not get good results. Can someone help me with this? I wonder if ppo can not handle this kind of randomized obstacles. Thank you very much!!!!!