When a model is trained on one task and the learned features are used as a starting point for a model on a second task, it's known as ________.

  • Transfer Learning
  • Data Augmentation
  • Ensemble Learning
  • Gradient Boosting
Transfer learning is a technique where knowledge gained from one task is applied as the starting point for another task. This helps leverage pre-trained models and speeds up learning on the new task.

Which application of machine learning in healthcare helps in predicting patient diseases based on their medical history?

  • Diagnostic Prediction
  • Medication Recommendation
  • Patient Scheduling
  • X-ray Image Analysis
Machine learning in healthcare is extensively used for Diagnostic Prediction, where algorithms analyze a patient's medical history to predict diseases.

A fashion company wants to create new designs based on current fashion trends. They decide to use machine learning to generate these designs. Which technology would be best suited for this purpose?

  • Convolutional Neural Networks (CNN)
  • Recurrent Neural Networks (RNN)
  • Support Vector Machines (SVM)
  • Decision Trees
Convolutional Neural Networks (CNN) are particularly well-suited for image-related tasks, making them ideal for generating fashion designs based on trends. CNNs can capture intricate patterns and styles within images, which is essential in the fashion industry.

In the context of regularization, what is the primary difference between L1 and L2 regularization?

  • L1 regularization adds the absolute values of coefficients as a penalty, leading to feature selection
  • L1 regularization adds the squared values of coefficients as a penalty, promoting sparsity
  • L2 regularization adds the absolute values of coefficients as a penalty, promoting sparsity
  • L2 regularization adds the squared values of coefficients as a penalty, leading to feature selection
L1 regularization, also known as Lasso, adds the absolute values of coefficients as a penalty, which promotes feature selection by driving some coefficients to zero. In contrast, L2 regularization, or Ridge, adds the squared values of coefficients as a penalty, which doesn't drive coefficients to zero and instead promotes a "shrinking" effect.

Which of the following techniques is used to estimate future rewards in reinforcement learning?

  • Q-Learning
  • Gradient Descent
  • Principal Component Analysis
  • K-Means Clustering
Q-Learning is a technique in reinforcement learning used to estimate future rewards associated with taking actions in different states.

What is the potential consequence of deploying a non-interpretable machine learning model in a critical sector, such as medical diagnosis?

  • Inability to explain decisions
  • Improved accuracy
  • Faster decision-making
  • Better generalization
Deploying a non-interpretable model can result in a lack of transparency, making it challenging to understand how and why the model makes specific medical diagnosis decisions. This lack of transparency can be risky in critical sectors.

The ability of SVMs to handle non-linear decision boundaries is achieved by transforming the input data into a higher-dimensional space using a ______.

  • Classifier
  • Dimensionality Reduction
  • Ensemble
  • Kernel
SVMs use a mathematical function called a kernel to transform data into a higher-dimensional space, enabling them to handle non-linear decision boundaries effectively.

The multi-armed bandit problem is a classic problem in which domain?

  • Sequential Decision-Making Problems
  • Natural Language Processing
  • Computer Graphics
  • Speech Recognition
The multi-armed bandit problem falls under the domain of Sequential Decision-Making Problems, specifically addressing scenarios where a decision must be made over time with limited resources.

In the context of RNNs, what problem does the introduction of gating mechanisms in LSTMs and GRUs aim to address?

  • Vanishing and Exploding Gradients
  • Overfitting and Data Loss
  • Dimensionality Reduction and Compression
  • Sequence Length Reduction and Truncation
The introduction of gating mechanisms in LSTMs and GRUs aims to address the problem of vanishing and exploding gradients, which occur during training due to the backpropagation of errors over long sequences. These mechanisms help RNNs capture long-range dependencies in data.

In K-means clustering, the algorithm iteratively updates the cluster centers until the within-cluster sum of squares is ________.

  • Minimized
  • Equal to 0
  • Maximized
  • Converged
In K-means clustering, the algorithm aims to minimize the within-cluster sum of squares (WCSS). This represents the total variance within clusters. As the algorithm iteratively updates the cluster centers, the goal is to minimize the WCSS, making "Minimized" the correct option.