One of the challenges in DQN is that small updates to Q values can lead to significant changes in the policy, making the learning process highly ________.

  • Sensitive
  • Efficient
  • Predictable
  • Robust
The term 'sensitive' in this context refers to the fact that small changes in Q values can have a disproportionate impact on the policy, making it unstable and hard to control.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *