The Naive Bayes classifier assumes that the presence or absence of a particular feature of a class is ________ of the presence or absence of any other feature.

  • Correlated
  • Dependent
  • Independent
  • Unrelated
Naive Bayes assumes that features are independent of each other. This simplifying assumption helps make the algorithm computationally tractable but might not hold in all real-world cases.

Regularization techniques help in preventing overfitting. Which of these is NOT a regularization technique: Batch Normalization, Dropout, Adam Optimizer, L1 Regularization?

  • Adam Optimizer
  • Batch Normalization
  • Dropout
  • L1 Regularization
Adam Optimizer is not a regularization technique. It's an optimization algorithm used in training neural networks, while the others are regularization methods.

A medical research team is studying the relationship between various health metrics (like blood pressure, cholesterol level) and the likelihood of developing a certain disease. The outcome is binary (disease: yes/no). Which regression model should they employ?

  • Decision Tree Regression
  • Linear Regression
  • Logistic Regression
  • Polynomial Regression
Logistic Regression is the appropriate choice for binary outcomes, such as the likelihood of developing a disease (yes/no). It models the probability of a binary outcome based on predictor variables, making it well-suited for this medical research.

In the context of the multi-armed bandit problem, what is regret?

  • The feeling of loss and remorse
  • An optimization metric
  • A random variable
  • An arm selection policy
In the context of the multi-armed bandit problem, regret is an optimization metric that quantifies how much an agent's total reward falls short of the best possible reward it could have achieved by always choosing the best arm. It's a way to measure how well an agent's arm selection policy performs.

In which scenario is unsupervised learning least suitable: predicting house prices based on features, grouping customers into segments, or classifying emails as spam or not spam?

  • Classifying emails as spam or not spam
  • Grouping customers into segments
  • Predicting house prices based on features
  • Unsupervised learning is suitable for all scenarios
Unsupervised learning is least suitable for classifying emails as spam or not spam. This is because unsupervised learning doesn't have labeled data to distinguish between spam and non-spam emails. It is more applicable to clustering or grouping data when you don't have clear labels, such as grouping customers into segments.

In the k-NN algorithm, as the value of k increases, the decision boundary becomes __________.

  • Linear
  • More complex
  • More simplified
  • Non-existent
As the value of k in k-NN increases, the decision boundary becomes more simplified because it is based on fewer neighboring data points.

A company wants to segment its customers based on their purchasing behavior. They have a fair idea that there are around 5 distinct segments but want to confirm this. Which clustering algorithm might they start with?

  • K-Means Clustering
  • Agglomerative Hierarchical Clustering
  • Mean-Shift Clustering
  • Spectral Clustering
The company might start with K-Means Clustering to confirm their idea of five distinct segments. K-Means is often used for partitioning data into a pre-specified number of clusters and can be a good choice when you have a rough idea of the number of clusters.

Variational autoencoders (VAEs) introduce a probabilistic spin to autoencoders by associating a ________ with the encoded representations.

  • Probability Distribution
  • Singular Value Decomposition
  • Principal Component
  • Regression Function
VAEs introduce a probabilistic element to autoencoders by associating a probability distribution (typically Gaussian) with the encoded representations. This allows for generating new data points.

Which regression technique is primarily used for predicting a continuous outcome variable (like house price)?

  • Decision Tree Regression
  • Linear Regression
  • Logistic Regression
  • Polynomial Regression
Linear Regression is the most common technique for predicting a continuous outcome variable, such as house prices. It establishes a linear relationship between input features and the output.

The Actor-Critic model combines value-based and ________ methods to optimize its decision-making process.

  • Policy-Based
  • Model-Free
  • Model-Based
  • Q-Learning
The Actor-Critic model combines value-based (critic) and model-free (actor) methods to optimize decision-making. The critic evaluates actions using value functions, and the actor selects actions based on this evaluation, thus combining two approaches for improved learning.