You are asked to apply Hierarchical Clustering to a dataset with mixed types of data (categorical and numerical). What challenges could arise and how would you tackle them?

  • All of the above
  • Computationally intensive clustering
  • Difficulty in defining a suitable distance metric
  • Inaccurate clustering due to the scale of numerical features
The primary challenge in clustering mixed types of data is defining a suitable distance metric that can handle both categorical and numerical features. You may need to standardize numerical features and find appropriate ways to measure distances for categorical attributes (e.g., using Gower distance). This choice will significantly influence the quality and interpretability of the clustering.
Add your answer
Loading...

Leave a comment

Your email address will not be published. Required fields are marked *