You're working with a dataset where two features, 'age' and 'years of experience', have a high correlation. Which problem does this situation exemplify?
- Data leakage
- Multicollinearity
- Overfitting
- Underfitting
This situation exemplifies multicollinearity, a condition where two or more predictors in a multiple regression model are highly correlated. This high correlation means that 'age' and 'years of experience' provide similar information in predicting the dependent variable.
Loading...
Related Quiz
- The _________ function in Matplotlib is used to create a figure and a set of subplots.
- How would the mean change if an additional number far away from the current mean were added to the dataset?
- What is the primary advantage of using Plotly over Matplotlib?
- What are the key components to focus on during the 'communicate' step in EDA?
- When data is normally distributed, approximately 95% of the data falls within ________ standard deviations of the mean.