When handling outliers in a dataset with skewed distributions, which measure of central tendency is preferred for imputation?
- Mean
- Median
- Mode
- Geometric Mean
When dealing with skewed datasets, the median is preferred for imputation. The median is robust to extreme values and is less affected by outliers than the mean. Using the median as the measure of central tendency helps maintain the integrity of the dataset in the presence of outliers.
Loading...
Related Quiz
- A research team is analyzing a large dataset with multiple features. They want to identify clusters or groups in the data. What visualization technique can help them visualize high-dimensional data in a 2D or 3D space?
- In Matplotlib, the foundation for all visualizations is the _______ object, which provides the canvas where plots are drawn.
- What is the primary objective of feature scaling in a dataset?
- Which Python library is specifically designed for statistical data visualization and is built on top of Matplotlib?
- You are responsible for ensuring that the data in your company's data warehouse is consistent, reliable, and easily accessible. Recently, there have been complaints about data discrepancies. Which stage in the ETL process should you primarily focus on to resolve these issues?