Data cleaning

https://terms.codata.org/rdmt/data-cleaning

alt label
Data cleansing
Data scrubbing
definition Process of detecting and correcting corrupt or inaccurate records from a dataset. Data cleaning is a continuous process that requires corrective actions throughout the data lifecycle. Data cleaning involves identifying, replacing, modifying or deleting incomplete, incorrect, inaccurate, inconsistent, irrelevant, and improperly formatted data. Typically, the process involves updating, correcting, standardising, and de-duplicating records to create a single view of the data, even if they are stored in multiple disparate systems.
editorial note Expert review decision, 2021-22: Edit
type
Resource original
Concept original
in scheme
https://terms.codata.org/rdmt original
has top concept
data-cleaning original
top concept of rdmt original