Which of the following techniques is used to estimate future rewards in reinforcement learning?

Q-Learning
Gradient Descent
Principal Component Analysis
K-Means Clustering

Q-Learning is a technique in reinforcement learning used to estimate future rewards associated with taking actions in different states.

Add your answer

Facebook Twitter Linkedin Reddit Pinterest

Machine Learning Quiz

What is the main challenge faced by NLP systems when processing clinical notes in electronic health records?

When a model is trained on one task and the learned features are used as a starting point for a model on a second task, it's known as ________.

Related Quiz

Leave a commentCancel