The balance between taking known rewarding actions and discovering new ones is termed as the ________ dilemma.
- Exploration-Exploitation
- Action-Selection
- Reinforcement-Reward
- Decision-Making
This is known as the Exploration-Exploitation dilemma, which is central in Reinforcement Learning, referring to the balance between exploiting known actions for rewards and exploring new actions to discover more rewarding ones.
Loading...
Related Quiz
- What is the primary objective of Generative Adversarial Networks (GANs)?
- One of the applications of NLP in healthcare is to assist in ________, which involves the conversion of voice-recorded notes into text format.
- When considering a confusion matrix, which metric calculates the harmonic mean of precision and recall?
- A company wants to determine the best version of their website homepage among five different designs. They decide to show each version to a subset of visitors and observe which version results in the highest user engagement. This problem is analogous to which classical problem in reinforcement learning?
- A robot is navigating a maze. Initially, it often runs into walls or dead-ends, but over time it starts finding the exit more frequently. To achieve this, the robot likely emphasized ________ in the beginning and shifted towards ________ over time.