While linear regression is concerned with estimating the mean of the dependent variable, logistic regression estimates the probability that the dependent variable belongs to a particular ________.

  • Category
  • Class
  • Cluster
  • Group
Logistic regression estimates the probability that the dependent variable belongs to a particular class or category. Unlike linear regression, which predicts continuous values, logistic regression is used for classification problems.

Which type of learning is typically employed when there's neither complete supervision nor complete absence of supervision, but a mix where an agent learns to act in an environment?

  • Reinforcement Learning
  • Self-supervised Learning
  • Semi-supervised Learning
  • Unsupervised Learning
Semi-supervised Learning fits this scenario. It combines labeled and unlabeled data to train a model. In situations where you have some labeled data but not enough for full supervision, or when labeling is expensive, semi-supervised learning is a practical choice.

Why is Independent Component Analysis (ICA) primarily used in applications like audio signal processing?

  • It's more computationally efficient
  • It separates mixed sources effectively
  • It requires less data for training
  • It's based on supervised learning
ICA is used in audio signal processing because it can effectively separate mixed sources, making it useful for source separation and blind signal separation tasks.

A healthcare company wants to classify patients into risk categories based on their medical history. They have a vast amount of patient data, but the relationships between variables are complex and non-linear. Which algorithm might be more suitable for this task?

  • Decision Trees
  • K-Nearest Neighbors (K-NN)
  • Logistic Regression
  • Naive Bayes
Decision Trees are suitable for complex and non-linear relationships between variables. They can capture intricate patterns in patient data, making them effective for risk categorization in healthcare.

In pharmacology, machine learning can aid in the process of drug discovery by predicting potential ________ of new compounds.

  • Toxicity
  • Flavor Profile
  • Market Demand
  • Molecular Structure
Machine learning can predict potential toxicity of new compounds by analyzing their chemical properties and interactions in pharmacology.

When dealing with high-dimensional data, which of the two algorithms (k-NN or Naive Bayes) is likely to be more efficient in terms of computational time?

  • Both Equally Efficient
  • Naive Bayes
  • Neither is Efficient
  • k-NN
Naive Bayes is typically more efficient in high-dimensional data due to its simple probabilistic calculations, while k-NN can suffer from the "curse of dimensionality."

In the k-NN algorithm, as the value of k increases, the decision boundary becomes __________.

  • Linear
  • More complex
  • More simplified
  • Non-existent
As the value of k in k-NN increases, the decision boundary becomes more simplified because it is based on fewer neighboring data points.

A company wants to segment its customers based on their purchasing behavior. They have a fair idea that there are around 5 distinct segments but want to confirm this. Which clustering algorithm might they start with?

  • K-Means Clustering
  • Agglomerative Hierarchical Clustering
  • Mean-Shift Clustering
  • Spectral Clustering
The company might start with K-Means Clustering to confirm their idea of five distinct segments. K-Means is often used for partitioning data into a pre-specified number of clusters and can be a good choice when you have a rough idea of the number of clusters.

Variational autoencoders (VAEs) introduce a probabilistic spin to autoencoders by associating a ________ with the encoded representations.

  • Probability Distribution
  • Singular Value Decomposition
  • Principal Component
  • Regression Function
VAEs introduce a probabilistic element to autoencoders by associating a probability distribution (typically Gaussian) with the encoded representations. This allows for generating new data points.

Which regression technique is primarily used for predicting a continuous outcome variable (like house price)?

  • Decision Tree Regression
  • Linear Regression
  • Logistic Regression
  • Polynomial Regression
Linear Regression is the most common technique for predicting a continuous outcome variable, such as house prices. It establishes a linear relationship between input features and the output.