What is the purpose of 'normalization' or 'standardization' in the pre-processing step of cluster analysis?
- To decrease the number of clusters
- To ensure that all features contribute equally to the distance calculation
- To handle missing values
- To increase the computational complexity
Normalization or standardization ensures that all features contribute equally to the final distance calculation, regardless of their original scale. Without this step, features with larger scales would dominate the distance calculation, potentially leading to misleading clusters.
Loading...