Data transformation: an underestimated tool by inappropriate use

Detalhes bibliográficos
Autor(a) principal: Ribeiro-Oliveira, João Paulo
Data de Publicação: 2018
Outros Autores: Santana, Denise Garcia de, Pereira, Vanderley José, Santos, Carlos Machado dos
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Acta Scientiarum. Agronomy (Online)
Texto Completo: http://www.periodicos.uem.br/ojs/index.php/ActaSciAgron/article/view/35300
Resumo: There are researchers who do not recommend data transformation arguing it causes problems in inferences and mischaracterises data sets, which can hinder interpretation. There are other researchers who consider data transformation necessary to meet the assumptions of parametric models. Perhaps the largest group of researchers who make use of data transformation are concerned with experimental accuracy, which provokes the misuse of this tool. Considering this, our paper offer a study about the most frequent situations related to data transformation and how this tool can impact ANOVA assumptions and experimental accuracy. Our database was obtained from measurements of seed physiology and seed technology. The coefficient of variation cannot be used as an indicator of data transformation. Data transformation might violate the assumptions of analysis of variance, invalidating the idea that its use will provoke fail the inferences, even if it does not improve the quality of the analysis. The decision about when to use data transformation is dichotomous, but the criteria for this decision are many. The unit (percentage, day or seedlings per day), the experimental design and the possible robustness of F-statistics to ‘small deviations’ to Normal are among the main indicators for the choice of the type of transformation. 
id UEM-5_b4eab4c257292895cf898d244b32413f
oai_identifier_str oai:periodicos.uem.br/ojs:article/35300
network_acronym_str UEM-5
network_name_str Acta Scientiarum. Agronomy (Online)
repository_id_str
spelling Data transformation: an underestimated tool by inappropriate useassumptionscoefficient of variationcriteria for data transformationparametric and nonparametric statisticsrobustness.Análise de DadosThere are researchers who do not recommend data transformation arguing it causes problems in inferences and mischaracterises data sets, which can hinder interpretation. There are other researchers who consider data transformation necessary to meet the assumptions of parametric models. Perhaps the largest group of researchers who make use of data transformation are concerned with experimental accuracy, which provokes the misuse of this tool. Considering this, our paper offer a study about the most frequent situations related to data transformation and how this tool can impact ANOVA assumptions and experimental accuracy. Our database was obtained from measurements of seed physiology and seed technology. The coefficient of variation cannot be used as an indicator of data transformation. Data transformation might violate the assumptions of analysis of variance, invalidating the idea that its use will provoke fail the inferences, even if it does not improve the quality of the analysis. The decision about when to use data transformation is dichotomous, but the criteria for this decision are many. The unit (percentage, day or seedlings per day), the experimental design and the possible robustness of F-statistics to ‘small deviations’ to Normal are among the main indicators for the choice of the type of transformation. Universidade Estadual de Maringá2018-03-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionData set analysisapplication/pdfapplication/vnd.openxmlformats-officedocument.wordprocessingml.documenthttp://www.periodicos.uem.br/ojs/index.php/ActaSciAgron/article/view/3530010.4025/actasciagron.v40i1.35300Acta Scientiarum. Agronomy; Vol 40 (2018): Publicação Contínua; e35300Acta Scientiarum. Agronomy; v. 40 (2018): Publicação Contínua; e353001807-86211679-9275reponame:Acta Scientiarum. Agronomy (Online)instname:Universidade Estadual de Maringá (UEM)instacron:UEMenghttp://www.periodicos.uem.br/ojs/index.php/ActaSciAgron/article/view/35300/pdfhttp://www.periodicos.uem.br/ojs/index.php/ActaSciAgron/article/view/35300/751375145576Copyright (c) 2018 Acta Scientiarum. Agronomyinfo:eu-repo/semantics/openAccessRibeiro-Oliveira, João PauloSantana, Denise Garcia dePereira, Vanderley JoséSantos, Carlos Machado dos2019-09-24T12:26:47Zoai:periodicos.uem.br/ojs:article/35300Revistahttp://www.periodicos.uem.br/ojs/index.php/ActaSciAgronPUBhttp://www.periodicos.uem.br/ojs/index.php/ActaSciAgron/oaiactaagron@uem.br||actaagron@uem.br|| edamasio@uem.br1807-86211679-9275opendoar:2019-09-24T12:26:47Acta Scientiarum. Agronomy (Online) - Universidade Estadual de Maringá (UEM)false
dc.title.none.fl_str_mv Data transformation: an underestimated tool by inappropriate use
title Data transformation: an underestimated tool by inappropriate use
spellingShingle Data transformation: an underestimated tool by inappropriate use
Ribeiro-Oliveira, João Paulo
assumptions
coefficient of variation
criteria for data transformation
parametric and nonparametric statistics
robustness.
Análise de Dados
title_short Data transformation: an underestimated tool by inappropriate use
title_full Data transformation: an underestimated tool by inappropriate use
title_fullStr Data transformation: an underestimated tool by inappropriate use
title_full_unstemmed Data transformation: an underestimated tool by inappropriate use
title_sort Data transformation: an underestimated tool by inappropriate use
author Ribeiro-Oliveira, João Paulo
author_facet Ribeiro-Oliveira, João Paulo
Santana, Denise Garcia de
Pereira, Vanderley José
Santos, Carlos Machado dos
author_role author
author2 Santana, Denise Garcia de
Pereira, Vanderley José
Santos, Carlos Machado dos
author2_role author
author
author
dc.contributor.author.fl_str_mv Ribeiro-Oliveira, João Paulo
Santana, Denise Garcia de
Pereira, Vanderley José
Santos, Carlos Machado dos
dc.subject.por.fl_str_mv assumptions
coefficient of variation
criteria for data transformation
parametric and nonparametric statistics
robustness.
Análise de Dados
topic assumptions
coefficient of variation
criteria for data transformation
parametric and nonparametric statistics
robustness.
Análise de Dados
description There are researchers who do not recommend data transformation arguing it causes problems in inferences and mischaracterises data sets, which can hinder interpretation. There are other researchers who consider data transformation necessary to meet the assumptions of parametric models. Perhaps the largest group of researchers who make use of data transformation are concerned with experimental accuracy, which provokes the misuse of this tool. Considering this, our paper offer a study about the most frequent situations related to data transformation and how this tool can impact ANOVA assumptions and experimental accuracy. Our database was obtained from measurements of seed physiology and seed technology. The coefficient of variation cannot be used as an indicator of data transformation. Data transformation might violate the assumptions of analysis of variance, invalidating the idea that its use will provoke fail the inferences, even if it does not improve the quality of the analysis. The decision about when to use data transformation is dichotomous, but the criteria for this decision are many. The unit (percentage, day or seedlings per day), the experimental design and the possible robustness of F-statistics to ‘small deviations’ to Normal are among the main indicators for the choice of the type of transformation. 
publishDate 2018
dc.date.none.fl_str_mv 2018-03-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Data set analysis
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://www.periodicos.uem.br/ojs/index.php/ActaSciAgron/article/view/35300
10.4025/actasciagron.v40i1.35300
url http://www.periodicos.uem.br/ojs/index.php/ActaSciAgron/article/view/35300
identifier_str_mv 10.4025/actasciagron.v40i1.35300
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv http://www.periodicos.uem.br/ojs/index.php/ActaSciAgron/article/view/35300/pdf
http://www.periodicos.uem.br/ojs/index.php/ActaSciAgron/article/view/35300/751375145576
dc.rights.driver.fl_str_mv Copyright (c) 2018 Acta Scientiarum. Agronomy
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2018 Acta Scientiarum. Agronomy
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
application/vnd.openxmlformats-officedocument.wordprocessingml.document
dc.publisher.none.fl_str_mv Universidade Estadual de Maringá
publisher.none.fl_str_mv Universidade Estadual de Maringá
dc.source.none.fl_str_mv Acta Scientiarum. Agronomy; Vol 40 (2018): Publicação Contínua; e35300
Acta Scientiarum. Agronomy; v. 40 (2018): Publicação Contínua; e35300
1807-8621
1679-9275
reponame:Acta Scientiarum. Agronomy (Online)
instname:Universidade Estadual de Maringá (UEM)
instacron:UEM
instname_str Universidade Estadual de Maringá (UEM)
instacron_str UEM
institution UEM
reponame_str Acta Scientiarum. Agronomy (Online)
collection Acta Scientiarum. Agronomy (Online)
repository.name.fl_str_mv Acta Scientiarum. Agronomy (Online) - Universidade Estadual de Maringá (UEM)
repository.mail.fl_str_mv actaagron@uem.br||actaagron@uem.br|| edamasio@uem.br
_version_ 1799305910007365632