A medical research company is working on image data, where they want to classify microscopic images into cancerous and non-cancerous categories. The boundary between these categories is not linear. Which algorithm would be a strong candidate for this problem?
- Convolutional Neural Network (CNN)
- Logistic Regression
- Naive Bayes Classifier
- Principal Component Analysis
Convolutional Neural Networks (CNNs) are excellent for image classification tasks, especially when dealing with non-linear boundaries. They use convolutional layers to extract features from images, making them suitable for tasks like cancerous/non-cancerous image classification.
Which regression method assumes a linear relationship between the independent and dependent variables?
- Decision Tree Regression
- Logistic Regression
- Polynomial Regression
- Ridge Regression
Polynomial Regression assumes a linear relationship between the independent and dependent variables. It models relationships as polynomial functions, and other regression methods may assume different relationships.
In the context of reinforcement learning, what does the term "gradient" in "Policy Gradient Methods" refer to?
- The direction of steepest ascent in the policy space
- A mathematical term used to describe the rate of change
- The probability distribution of actions
- The value function
In "Policy Gradient Methods," the "gradient" refers to the direction in the policy space that increases the expected reward. It guides policy updates to maximize reward.
For a non-linearly separable dataset, which property of SVMs allows them to classify the data?
- Feature selection
- Kernel functions
- Large training dataset
- Parallel processing
SVMs can classify non-linearly separable data using kernel functions, which map the data into a higher-dimensional space where it becomes linearly separable.
For text classification problems, the ________ variant of Naive Bayes is often used.
- K-Means
- Multinomial
- Random Forest
- SVM
In text classification, the Multinomial variant of Naive Bayes is commonly used due to its suitability for modeling discrete data like word counts.
The Actor-Critic model combines value-based and ________ methods to optimize its decision-making process.
- Policy-Based
- Model-Free
- Model-Based
- Q-Learning
The Actor-Critic model combines value-based (critic) and model-free (actor) methods to optimize decision-making. The critic evaluates actions using value functions, and the actor selects actions based on this evaluation, thus combining two approaches for improved learning.
Which regression technique is primarily used for predicting a continuous outcome variable (like house price)?
- Decision Tree Regression
- Linear Regression
- Logistic Regression
- Polynomial Regression
Linear Regression is the most common technique for predicting a continuous outcome variable, such as house prices. It establishes a linear relationship between input features and the output.
Variational autoencoders (VAEs) introduce a probabilistic spin to autoencoders by associating a ________ with the encoded representations.
- Probability Distribution
- Singular Value Decomposition
- Principal Component
- Regression Function
VAEs introduce a probabilistic element to autoencoders by associating a probability distribution (typically Gaussian) with the encoded representations. This allows for generating new data points.
Which ensemble method combines multiple decision trees and aggregates their results for improved accuracy and reduced overfitting?
- Logistic Regression
- Naive Bayes
- Principal Component Analysis (PCA)
- Random Forest
Random Forest is an ensemble method that combines multiple decision trees. It aggregates their results through techniques like bagging and boosting to achieve better accuracy and reduce overfitting. Random Forest is a popular choice for various machine learning tasks.
In the context of text classification, Naive Bayes often works well because it can handle what type of data?
- High-Dimensional and Sparse Data
- Images and Videos
- Low-Dimensional and Dense Data
- Numeric Data
Naive Bayes is effective with high-dimensional and sparse data as it assumes independence between features, making it suitable for text data with numerous attributes.
In ________ learning, the algorithm isn't provided with the correct answers but discovers them through exploration and exploitation.
- Reinforcement
- Semi-supervised
- Supervised
- Unsupervised
Reinforcement learning involves exploration and exploitation strategies, where the algorithm learns by trial and error and discovers correct answers over time. It doesn't start with pre-defined correct answers.
Which algorithm is a popular choice for solving the multi-armed bandit problem when the number of arms is large and some structure can be assumed on the rewards?
- Epsilon-Greedy
- UCB1
- Thompson Sampling
- Greedy
UCB1 (Upper Confidence Bound 1) is a popular choice for the multi-armed bandit problem when you can assume some structure on the rewards and the number of arms is large. UCB1 balances exploration and exploitation effectively by using confidence bounds to select arms.