How would you handle missing data for a numerical feature in a dataset before training a machine learning model?

  • Ignore missing data, it won't affect the model
  • Remove the rows with missing data
  • Replace missing values with a random value
  • Replace missing values with the mean of the feature
Handling missing data is crucial. Replacing missing values with the mean of the feature is a common practice as it retains data and doesn't introduce bias, especially in numerical features. Removing rows or using random values can lead to loss of information or noise.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *