The hardware and bandwidth for this mirror is donated by METANET, the Webhosting and Full Service-Cloud Provider.
If you wish to report a bug, or if you are interested in having us mirror your free-software or open-source project, please feel free to contact us at mirror[@]metanet.ch.
Missing data is defined as “data that we intended to collect, but for one reason or another were unable to” [1].
Missing data represents a loss of information, and so reduces the statistical power of a study [2].
Different methods for handling missing data (e.g. mean imputation)
can introduce bias when estimating statistics [3]. Therefore, to produce
unbiased estimates, one must choose the correct method. In order to do
this, one must understand and categorise the pattern of missingness in
the dataset. Having categorised the missingness as MCAR, MAR, or MNAR
(see vignette("background")
) one can then choose the
correct method for handling the missing data without introducing
bias.
This package provides three functions — mcar()
,
mar()
, and mnar()
— to help you categorise the
missingness in a dataset.
This package also provides several toy datasets:
animalhealth
pollutionlevels
healthcheck
companydata
testscores
There is a lack of information and software on practically categorising missing data.
[1] Carpenter JR, Kenward MG. Missing Data in Randomised Controlled Trials: A Practical Guide. Health Technology Assessment Methodology Programme; 2007.
[2] Pham TM, Pandis N, White IR. Missing data: Issues, concepts, methods. Seminars in Orthodontics. 2024;30(1):37-44. Statis- tics every orthodontist should know.
[3] van Buuren S. Flexible Imputation of Missing Data, Second Edition. Chapman and Hall/CRC; 2018.
These binaries (installable software) and packages are in development.
They may not be fully stable and should be used with caution. We make no claims about them.