Search Unity

  1. Welcome to the Unity Forums! Please take the time to read our Code of Conduct to familiarize yourself with the forum rules and how to post constructively.
Dismiss Notice
Join us now in the Performance Profiling Dev Blitz Day 2023 - Q&A forum where you can connect with our teams behind the Memory and CPU Profilers and the Frame Debugger.

Question what does it mean when the policy loss graph oscillates rapidly?

Discussion in 'ML-Agents' started by SuperRaed, Oct 30, 2022.

  1. SuperRaed

    SuperRaed

    Joined:
    Aug 18, 2015
    Posts:
    3
    Hello guys,
    We have trained an agent to shoot at enemy's while avoiding friends, the behavior seems to be decent, but at moments lacks accuracy; it can take multiple shoots to get the target.
    The overall graphs seem ok, but one thing really stood out for me is how the policy loss graph seems to fluctuate with high rapidity.

    Does anyone know what might be going on?
    (Below are graphs)
     

    Attached Files:

    • CFL.png
      CFL.png
      File size:
      26.4 KB
      Views:
      35
    • CRH.png
      CRH.png
      File size:
      106.1 KB
      Views:
      34
    • PL.png
      PL.png
      File size:
      50.4 KB
      Views:
      36
    • VL.png
      VL.png
      File size:
      37.6 KB
      Views:
      35
    • CR.png
      CR.png
      File size:
      33.4 KB
      Views:
      32