In data cleansing, identifying and handling duplicate records is referred to as ________.

Aggregation
Deduplication
Normalization
Segmentation

Deduplication is the process of identifying and removing duplicate records or entries from a dataset. Duplicate records can arise due to data entry errors, system issues, or data integration challenges, leading to inaccuracies and redundancies in the dataset. By detecting and eliminating duplicates, data cleansing efforts aim to improve data quality, reduce storage costs, and enhance the effectiveness of data analysis and decision-making processes.

Add your answer