Regularization techniques help in preventing overfitting. Which of these is NOT a regularization technique: Batch Normalization, Dropout, Adam Optimizer, L1 Regularization?

  • Adam Optimizer
  • Batch Normalization
  • Dropout
  • L1 Regularization
Adam Optimizer is not a regularization technique. It's an optimization algorithm used in training neural networks, while the others are regularization methods.

A medical research team is studying the relationship between various health metrics (like blood pressure, cholesterol level) and the likelihood of developing a certain disease. The outcome is binary (disease: yes/no). Which regression model should they employ?

  • Decision Tree Regression
  • Linear Regression
  • Logistic Regression
  • Polynomial Regression
Logistic Regression is the appropriate choice for binary outcomes, such as the likelihood of developing a disease (yes/no). It models the probability of a binary outcome based on predictor variables, making it well-suited for this medical research.

In the context of the multi-armed bandit problem, what is regret?

  • The feeling of loss and remorse
  • An optimization metric
  • A random variable
  • An arm selection policy
In the context of the multi-armed bandit problem, regret is an optimization metric that quantifies how much an agent's total reward falls short of the best possible reward it could have achieved by always choosing the best arm. It's a way to measure how well an agent's arm selection policy performs.

When NLP systems try to understand the context of words in medical documents to extract meaningful information, they are leveraging a technique called ________.

  • Named Entity Recognition
  • Sentiment Analysis
  • Document Summarization
  • Word Embeddings
Named Entity Recognition is a technique in NLP used to identify and classify entities in medical documents, such as drugs, diseases, or patient names.

In which scenario is unsupervised learning least suitable: predicting house prices based on features, grouping customers into segments, or classifying emails as spam or not spam?

  • Classifying emails as spam or not spam
  • Grouping customers into segments
  • Predicting house prices based on features
  • Unsupervised learning is suitable for all scenarios
Unsupervised learning is least suitable for classifying emails as spam or not spam. This is because unsupervised learning doesn't have labeled data to distinguish between spam and non-spam emails. It is more applicable to clustering or grouping data when you don't have clear labels, such as grouping customers into segments.

While supervised learning requires explicit labels, ________ learning operates on data without explicit instructions.

  • Deep
  • Reinforcement
  • Semi-supervised
  • Unsupervised
In machine learning, unsupervised learning operates on data without explicit labels or instructions. Supervised learning, on the other hand, relies heavily on labeled data.

CNNs are particularly effective for image data due to their ability to preserve the ________ structure of the data.

  • Spatial
  • Color
  • Temporal
  • Frequency
CNNs are effective for image data due to their ability to preserve the spatial structure of the data, which is crucial for detecting patterns in pixels' proximity.

While linear regression is concerned with estimating the mean of the dependent variable, logistic regression estimates the probability that the dependent variable belongs to a particular ________.

  • Category
  • Class
  • Cluster
  • Group
Logistic regression estimates the probability that the dependent variable belongs to a particular class or category. Unlike linear regression, which predicts continuous values, logistic regression is used for classification problems.

Which type of learning is typically employed when there's neither complete supervision nor complete absence of supervision, but a mix where an agent learns to act in an environment?

  • Reinforcement Learning
  • Self-supervised Learning
  • Semi-supervised Learning
  • Unsupervised Learning
Semi-supervised Learning fits this scenario. It combines labeled and unlabeled data to train a model. In situations where you have some labeled data but not enough for full supervision, or when labeling is expensive, semi-supervised learning is a practical choice.

Why is Independent Component Analysis (ICA) primarily used in applications like audio signal processing?

  • It's more computationally efficient
  • It separates mixed sources effectively
  • It requires less data for training
  • It's based on supervised learning
ICA is used in audio signal processing because it can effectively separate mixed sources, making it useful for source separation and blind signal separation tasks.