Abstract

Imputation methods were developed to define estimates for missing data and hence solve possible problems generated by the loss of this information. This study aims to assess whether data variability influences the results obtained after applying an imputation method. Incomplete databases were generated from complete real databases of experiments of tomato plants conducted using the randomized block design with three replications and 12 treatments by removing different amounts of data. The evaluated variables consisted of fruit weight per plant, number of fruits per plant, and average fruit length and width, forming eight balanced databases. Subsequently, the distribution-free multiple imputation method was applied, generating complete databases from imputation. The number of missing information influenced the accuracy measures for the data in this study. Data imputation was inadequate when there was high variability but more precise and accurate in cases of low variability. It confirmed the importance of assessing data variability before choosing to apply the imputation method.

Details

Title
Data variability in the imputation quality of missing data
Author
Elisandra Lúcia Moro Stochero  VIAFID ORCID Logo  ; Lúcio, Alessandro Dal'Col  VIAFID ORCID Logo  ; Luciane Flores Jacobi  VIAFID ORCID Logo 
First page
e66185
Section
Biometria, Modelagem e Estatística
Publication year
2024
Publication date
2024
Publisher
Editora da Universidade Estadual de Maringá - EDUEM
ISSN
16799275
e-ISSN
18078621
Source type
Scholarly Journal
Language of publication
Portuguese
ProQuest document ID
3236090404
Copyright
© 2024. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.