You are analyzing a dataset where the variable 'income' has a skewed distribution due to a few high-income individuals. What method would you recommend to handle these outliers?
- Binning
- Removal
- Transformation
- nan
In this case, the transformation method, such as log transformation, would be the best fit. It will help to reduce the skewness of the data by pulling in high values.
Loading...
Related Quiz
- What measure of central tendency is also known as the 50th percentile or the second quartile?
- Suppose you have an overfitting model. You identify that missing data was incorrectly filled with a constant value. How might this have contributed to overfitting?
- What does the acronym MCAR stand for in the context of missing data?
- How does incorrect imputation of missing data influence the accuracy of a predictive model?
- Why is it important to check the normality of residuals in regression analysis?