Data with consistent types (e.g., numeric, character) and structures (e.g., tidy tables).
The book by Mark van der Loo and Edwin de Jonge redefines data cleaning from a tedious chore into a rigorous, automated statistical discipline. It provides a systematic framework for transforming "raw" data into "valid" data ready for analysis, primarily using the R programming language. The Statistical Value Chain Statistical Data Cleaning with Applications in R
Data that has been checked against domain-specific rules and logical restrictions. Key Methodology and R Applications Data with consistent types (e
The authors emphasize that data cleaning is not just about removing errors but about identifying them through . Statistical Data Cleaning with Applications in R Data with consistent types (e.g.