In the Actor-Critic approach, the ________ provides a gradient for policy improvement based on feedback.
- Critic
- Agent
- Selector
- Actor
In the Actor-Critic approach, the Critic evaluates the policy and provides a gradient that guides policy improvement based on feedback, making it a fundamental element of the approach.
Loading...
Related Quiz
- In which learning approach does the model learn to...
- When dealing with high-dimensional data, which of the two algorithms (k-NN or Naive Bayes) is likely to be more efficient in terms of computational time?
- Which term refers to using a model that has already been trained on a large dataset and fine-tuning it for a specific task?
- The value at which the sigmoid function outputs a 0.5 probability, thereby determining the decision boundary in logistic regression, is known as the ________.
- For binary classification tasks, which regression outputs a probability score between 0 and 1?