In reinforcement learning, the agent learns a policy which maps states to ________.
- Actions
- Rewards
- Values
- Policies
In reinforcement learning, the agent learns a policy that maps states to optimal actions, hence filling in the blank with "Policies" is accurate. This policy helps the agent make decisions in various states.
Loading...
Related Quiz
- The multi-armed bandit problem is a classic problem in which domain?
- In which algorithm is the outcome determined based on a majority vote from its neighbors?
- What type of neural network is designed for encoding input data into a compressed representation and then decoding it back to its original form?
- Time series forecasting is crucial in fields like finance and meteorology because it helps in predicting stock prices and ________ respectively.
- Which technique involves setting a fraction of input units to 0 at each update during training time, which helps to prevent overfitting?