In ________ learning, the algorithm isn't provided with the correct answers but discovers them through exploration and exploitation.
- Reinforcement
- Semi-supervised
- Supervised
- Unsupervised
Reinforcement learning involves exploration and exploitation strategies, where the algorithm learns by trial and error and discovers correct answers over time. It doesn't start with pre-defined correct answers.
Loading...
Related Quiz
- In the context of reinforcement learning, what does the term "gradient" in "Policy Gradient Methods" refer to?
- A common activation function used in CNNs that helps introduce non-linearity is ________.
- The balance between taking known rewarding actions and discovering new ones is termed as the ________ dilemma.
- In K-means clustering, the algorithm iteratively updates the cluster centers until the within-cluster sum of squares is ________.
- In the context of regularization, what is the primary difference between L1 and L2 regularization?