Suppose your machine learning model shows a significant shift in performance when transitioning from the training set to the test set. How could mishandling missing data contribute to this issue?

  • It may have caused an imbalance in the data distribution between the sets.
  • It may have caused overfitting.
  • It may have led to the model learning irrelevant patterns.
  • It may have led to underfitting.
If the handling of missing data is not consistent between the training and test sets, it could lead to an imbalance in data distribution between the two sets, causing the model's performance to shift.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *