Distribution-free multiple imputation in an interaction matrix through singular value decomposition

Detalhes bibliográficos
Autor(a) principal: Bergamo,Genevile Carife
Data de Publicação: 2008
Outros Autores: Dias,Carlos Tadeu dos Santos, Krzanowski,Wojtek Janusz
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Scientia Agrícola (Online)
Texto Completo: http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-90162008000400015
Resumo: Some techniques of multivariate statistical analysis can only be conducted on a complete data matrix, but the process of data collection often misses some elements. Imputation is a technique by which the missing elements are replaced by plausible values, so that a valid analysis can be performed on the completed data set. A multiple imputation method is proposed based on a modification to the singular value decomposition (SVD) method for single imputation, developed by Krzanowski. The method was evaluated on a genotype × environment (G × E) interaction matrix obtained from a randomized blocks experiment on Eucalyptus grandis grown in multienvironments. Values of E. grandis heights in the G × E complete interaction matrix were deleted randomly at three different rates (5%, 10%, 30%) and were then imputed by the proposed methodology. The results were assessed by means of a general measure of performance (Tacc), and showed a small bias when compared to the original data. However, bias values were greater than the variability of imputations relative to their mean, indicating a smaller accuracy of the proposed method in relation to its precision. The proposed methodology uses the maximum amount of available information, does not have any restrictions regarding the pattern or mechanism of the missing values, and is free of assumptions on the data distribution or structure.
id USP-18_503fd705e643562b60640e75cecb37f2
oai_identifier_str oai:scielo:S0103-90162008000400015
network_acronym_str USP-18
network_name_str Scientia Agrícola (Online)
repository_id_str
spelling Distribution-free multiple imputation in an interaction matrix through singular value decompositionmissing datanonparametriceigenvalueeigenvectorgenotype-environmentSome techniques of multivariate statistical analysis can only be conducted on a complete data matrix, but the process of data collection often misses some elements. Imputation is a technique by which the missing elements are replaced by plausible values, so that a valid analysis can be performed on the completed data set. A multiple imputation method is proposed based on a modification to the singular value decomposition (SVD) method for single imputation, developed by Krzanowski. The method was evaluated on a genotype × environment (G × E) interaction matrix obtained from a randomized blocks experiment on Eucalyptus grandis grown in multienvironments. Values of E. grandis heights in the G × E complete interaction matrix were deleted randomly at three different rates (5%, 10%, 30%) and were then imputed by the proposed methodology. The results were assessed by means of a general measure of performance (Tacc), and showed a small bias when compared to the original data. However, bias values were greater than the variability of imputations relative to their mean, indicating a smaller accuracy of the proposed method in relation to its precision. The proposed methodology uses the maximum amount of available information, does not have any restrictions regarding the pattern or mechanism of the missing values, and is free of assumptions on the data distribution or structure.Escola Superior de Agricultura "Luiz de Queiroz"2008-01-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-90162008000400015Scientia Agricola v.65 n.4 2008reponame:Scientia Agrícola (Online)instname:Universidade de São Paulo (USP)instacron:USP10.1590/S0103-90162008000400015info:eu-repo/semantics/openAccessBergamo,Genevile CarifeDias,Carlos Tadeu dos SantosKrzanowski,Wojtek Januszeng2008-07-21T00:00:00Zoai:scielo:S0103-90162008000400015Revistahttp://revistas.usp.br/sa/indexPUBhttps://old.scielo.br/oai/scielo-oai.phpscientia@usp.br||alleoni@usp.br1678-992X0103-9016opendoar:2008-07-21T00:00Scientia Agrícola (Online) - Universidade de São Paulo (USP)false
dc.title.none.fl_str_mv Distribution-free multiple imputation in an interaction matrix through singular value decomposition
title Distribution-free multiple imputation in an interaction matrix through singular value decomposition
spellingShingle Distribution-free multiple imputation in an interaction matrix through singular value decomposition
Bergamo,Genevile Carife
missing data
nonparametric
eigenvalue
eigenvector
genotype-environment
title_short Distribution-free multiple imputation in an interaction matrix through singular value decomposition
title_full Distribution-free multiple imputation in an interaction matrix through singular value decomposition
title_fullStr Distribution-free multiple imputation in an interaction matrix through singular value decomposition
title_full_unstemmed Distribution-free multiple imputation in an interaction matrix through singular value decomposition
title_sort Distribution-free multiple imputation in an interaction matrix through singular value decomposition
author Bergamo,Genevile Carife
author_facet Bergamo,Genevile Carife
Dias,Carlos Tadeu dos Santos
Krzanowski,Wojtek Janusz
author_role author
author2 Dias,Carlos Tadeu dos Santos
Krzanowski,Wojtek Janusz
author2_role author
author
dc.contributor.author.fl_str_mv Bergamo,Genevile Carife
Dias,Carlos Tadeu dos Santos
Krzanowski,Wojtek Janusz
dc.subject.por.fl_str_mv missing data
nonparametric
eigenvalue
eigenvector
genotype-environment
topic missing data
nonparametric
eigenvalue
eigenvector
genotype-environment
description Some techniques of multivariate statistical analysis can only be conducted on a complete data matrix, but the process of data collection often misses some elements. Imputation is a technique by which the missing elements are replaced by plausible values, so that a valid analysis can be performed on the completed data set. A multiple imputation method is proposed based on a modification to the singular value decomposition (SVD) method for single imputation, developed by Krzanowski. The method was evaluated on a genotype × environment (G × E) interaction matrix obtained from a randomized blocks experiment on Eucalyptus grandis grown in multienvironments. Values of E. grandis heights in the G × E complete interaction matrix were deleted randomly at three different rates (5%, 10%, 30%) and were then imputed by the proposed methodology. The results were assessed by means of a general measure of performance (Tacc), and showed a small bias when compared to the original data. However, bias values were greater than the variability of imputations relative to their mean, indicating a smaller accuracy of the proposed method in relation to its precision. The proposed methodology uses the maximum amount of available information, does not have any restrictions regarding the pattern or mechanism of the missing values, and is free of assumptions on the data distribution or structure.
publishDate 2008
dc.date.none.fl_str_mv 2008-01-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-90162008000400015
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-90162008000400015
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1590/S0103-90162008000400015
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv text/html
dc.publisher.none.fl_str_mv Escola Superior de Agricultura "Luiz de Queiroz"
publisher.none.fl_str_mv Escola Superior de Agricultura "Luiz de Queiroz"
dc.source.none.fl_str_mv Scientia Agricola v.65 n.4 2008
reponame:Scientia Agrícola (Online)
instname:Universidade de São Paulo (USP)
instacron:USP
instname_str Universidade de São Paulo (USP)
instacron_str USP
institution USP
reponame_str Scientia Agrícola (Online)
collection Scientia Agrícola (Online)
repository.name.fl_str_mv Scientia Agrícola (Online) - Universidade de São Paulo (USP)
repository.mail.fl_str_mv scientia@usp.br||alleoni@usp.br
_version_ 1748936461009289216