In a dataset with a categorical variable missing for some rows, why might mode imputation not be the best strategy?

  • All of the above
  • It can introduce bias if the data is not missing at random
  • It could distort the original data distribution
  • It may not capture the underlying data pattern
Mode imputation might not be the best strategy for a dataset with a categorical variable missing for some rows. Although it's simple to implement, it may fail to capture the underlying data pattern, introduce bias if the data is not missing at random, and distort the original data distribution by overrepresenting the mode.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *