Search Unity

Question Why decision_steps in unityenvironment suddenly get nothing?

Discussion in 'ML-Agents' started by Dream_Surpass, Dec 9, 2022.

  1. Dream_Surpass

    Dream_Surpass

    Joined:
    Dec 2, 2022
    Posts:
    18
    I try to use custom PPO algorithm to train a continuous action env(3DBall or others) in python by ml-agents env, just like tutorial https://github.com/Unity-Technologi...op/colab/Colab_UnityEnvironment_2_Train.ipynb, which use custom Qlearning to train discrete action space env.

    At the beginning 3DBall env can run normally, but several seconds later it terminates.
    upload_2022-12-9_12-35-7.png

    I find decision_steps suddenly get nothing from last env step(it works normally at the beginning and the obs shape is [num_agents, obs_dim]), and I pass the obs to net to get the action(actually null), so compiler report an error above.
    upload_2022-12-9_12-27-16.png

    upload_2022-12-9_12-27-32.png

    Any ideas will be appreciated. Thanks!