Which of the following techniques is used to estimate future rewards in reinforcement learning?

  • Q-Learning
  • Gradient Descent
  • Principal Component Analysis
  • K-Means Clustering
Q-Learning is a technique in reinforcement learning used to estimate future rewards associated with taking actions in different states.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *