In DQNs, the target Q-values are updated less frequently than the predicted Q-values to ensure stability, using a concept known as ________ networks.

  • Target
  • Control
  • Reactive
  • Neural
In Deep Q Networks (DQNs), the target Q-values are updated less frequently to stabilize learning. This concept is known as 'target' networks, which involves having separate target Q-networks.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *