You are given a dataset with a significant amount of outliers. Which scaling method would be most suitable and why?
- , outliers should always be removed
- Min-Max scaling because it scales all values between 0 and 1
- Robust scaling because it is not affected by outliers
- Z-score standardization because it reduces skewness
Robust scaling would be the most suitable as it uses the median and the interquartile range, which are not sensitive to outliers, for scaling the data. Other methods like Min-Max and Z-score are affected by the presence of outliers.
Loading...
Related Quiz
- The _________ function in Matplotlib is used to create a figure and a set of subplots.
- What are the pitfalls to avoid when trying to improve the readability of a graph?
- When features in a dataset are highly correlated, they might suffer from a problem known as ________, which can negatively impact the machine learning model.
- How is the shape of a Normal Distribution usually described?
- A team member has used a histogram to represent a dataset but the representation seems biased. What could be the potential issue?