Question How to train PPO algorithm in MLagents using Python instead of CLI?

hlwang01 · May 8, 2023

I am seeking guidance on how to resolve the issue, or receive instructions on how to call the PPO algorithm using Python. As I need to configure several parameters in Python before executing PPO, the CLI training method is not suitable for my needs

I tried the following code, but it threw an error. I don't know if it's because the feature is not supported yet or if my code is incorrect. The error message is as follow

Code (Python):

from mlagents_envs.environment import UnityEnvironment

from mlagents.trainers.ppo.trainer import PPOTrainer

from mlagents.trainers.settings import TrainerSettings

trainer_settings = TrainerSettings()

env = UnityEnvironment("RollerAgent/RollerAgent.exe",no_graphics=True)

behavior_names = list(env.behavior_specs.keys())

print(behavior_names)

ppotrainer = PPOTrainer("RollerBall", 10, trainer_settings, True, False, 0, "")

env.reset()

for _ in range(1000):

decision_steps, terminal_steps = env.get_steps(behavior_names[0])

ppotrainer.advance()

error message
line 622, in _set_default_hyperparameters
return all_trainer_settings[self.trainer_type]()
KeyError: 'ppo'

Search Unity

Unity ID

Useful Searches

Question How to train PPO algorithm in MLagents using Python instead of CLI?

hlwang01