When an agent overly focuses on actions that have previously yielded rewards without exploring new possibilities, it might fall into a ________ trap.
- Exploitation
- Exploration
- Learning
- Reward
If an agent overly focuses on actions that have yielded rewards in the past, it falls into an exploitation trap, neglecting the exploration needed to find potentially better actions.
Loading...
Related Quiz
- In the context of the bias-variance trade-off, which one is typically associated with complex models with many parameters?
- Which of the following techniques is used to estimate future rewards in reinforcement learning?
- Which algorithm is based on the principle that similar data points are likely to have similar output values?
- When a model is trained on one task and the learned features are used as a starting point for a model on a second task, it's known as ________.
- A start-up is developing a speech recognition system that transcribes audio clips into text. The system needs to consider the order of spoken words and their context. Which neural network model would be best suited for this sequential data task?