In reinforcement learning, ________ focuses on trying new actions, while ________ focuses on leveraging known rewards.

  • Exploration Policy
  • Exploitation Policy
  • Random Policy
  • Deterministic Policy
In reinforcement learning, exploration policy focuses on trying new actions to learn more about the environment. Exploitation policy, on the other hand, leverages known rewards to make optimal decisions based on what's already learned.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *