Dirty data

https://terms.codata.org/rdmt/dirty-data

alt label
Dirty dataset
definition Data that contain errors. Can be caused by a number of factors including: inaccurate, incomplete or erroneous data such as spelling or punctuation errors, incorrect data or incorrect data type associated with a field, incomplete or outdated data, duplicate data, inconsistent data, incorrectly ordered data, improper parsing of fields from disparate systems, etc. Using a dirty dataset can lead to spurious associations, false conclusions and misdirected investments.
editorial note Expert review decision, 2021-22: No review this cycle
type
Resource original
Concept original
in scheme
https://terms.codata.org/rdmt original
has top concept
dirty-data original
top concept of rdmt original