In reinforcement learning, ________ focuses on trying new actions, while ________ focuses on leveraging known rewards.
- Exploration Policy
- Exploitation Policy
- Random Policy
- Deterministic Policy
In reinforcement learning, exploration policy focuses on trying new actions to learn more about the environment. Exploitation policy, on the other hand, leverages known rewards to make optimal decisions based on what's already learned.
Loading...
Related Quiz
- When an agent overly focuses on actions that have previously yielded rewards without exploring new possibilities, it might fall into a ________ trap.
- An autoencoder's primary objective is to minimize the difference between the input and the ________.
- A bank uses a machine learning model for loan approvals. However, it's observed that individuals from certain ethnic backgrounds are consistently getting rejected more than others, despite having similar financial profiles. This raises concerns related to which aspect of machine learning?
- Consider a scenario where a drone is learning to navigate through a maze. Which reinforcement learning algorithm can be utilized to train the drone?
- If you want to visualize high-dimensional data in a 2D or 3D space, which of the following techniques would be suitable?