Monash University
Browse

Expanding tidy data principles to facilitate missing data exploration, visualization and assessment of imputations

Download (4.42 MB)
journal contribution
posted on 2022-11-09, 04:40 authored by Nicholas Tierney, Dianne Cook
Despite the large body of research on missing value distributions and imputation, there is comparatively little literature on how to make it easy to handle, explore, and impute missing values in data. This paper addresses this gap. The new methodology builds upon tidy data principles, with a goal to integrating missing value handling as an integral part of data analysis workflows. New data structures are defined along with new functions (verbs) to perform common operations. Together these provide a cohesive framework for handling, exploring, and imputing missing values. These methods have been made available in the R package naniar.

History

Classification-JEL

C10, C14, C22

Creation date

2018-09-07

Working Paper Series Number

14/18

Length

41

File-Format

application/pdf

Handle

RePEc:msh:ebswps:2018-14

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC