A robot is navigating a maze. Initially, it often runs into walls or dead-ends, but over time it starts finding the exit more frequently. To achieve this, the robot likely emphasized ________ in the beginning and shifted towards ________ over time.
- Exploration, Exploitation
- Breadth-First Search
- Depth-First Search
- A* Search
In the context of reinforcement learning, the robot employs "exploration" initially to discover the maze, and as it learns, it shifts towards "exploitation" to choose actions that yield higher rewards, like finding the exit.
Loading...
Related Quiz
- In which learning approach does the model learn to make decisions by receiving rewards or penalties for its actions?
- In the Actor-Critic model, what role does the Critic's feedback play in adjusting the Actor's policies?
- Machine learning algorithms trained on medical images to detect anomalies are commonly referred to as ________.
- While LSTMs have three gates, the GRU simplifies the model by using only ________ gates.
- Techniques like backward elimination, forward selection, and recursive feature elimination are used for ________ in machine learning.