Search Unity

Question Question about bc+gail+extrinsic reward

Discussion in 'ML-Agents' started by KevinWoozy1423, Apr 12, 2023.

?

Question about bc+gail+extrinsic reward

  1. mlagents

    1 vote(s)
    100.0%
  2. behaviral cloning

    1 vote(s)
    100.0%
Multiple votes are allowed.
  1. KevinWoozy1423

    KevinWoozy1423

    Joined:
    Aug 9, 2021
    Posts:
    3
    Hi, I use the BC only to train my agent in 150,000 steps, I get the fairly good results. And then I use gail + extrinsic reward and continue training. The mean reward immediately goes down to negative and the agent hit the obstacle repeatly. Why did this happen? It seems that the agent forget everything it learns with BC and start learning from the beginning. Or perhaps my demonstrations is not good enough? My game is crane path planning in 3d space, the goal is to find the target.Here is my config file. upload_2023-4-12_21-9-17.png
    upload_2023-4-12_21-10-34.png
     

    Attached Files:

  2. KevinWoozy1423

    KevinWoozy1423

    Joined:
    Aug 9, 2021
    Posts:
    3
  3. y116114

    y116114

    Joined:
    Nov 23, 2020
    Posts:
    3
    Hello! I haven't had this problem. Can i see your C# code about "crane path planning"? My email is 1164072013@qq.com