Re-scaling of model evaluation measures to allow direct comparison of their values

Detalhes bibliográficos
Autor(a) principal: Barbosa, A. Márcia
Data de Publicação: 2015
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10174/22796
https://doi.org/10.5281/zenodo.15487
Resumo: Species distribution models are increasingly used in ecology, biogeography and climate change research, and are usually complemented with one or more metrics evaluating their performance. Not all metrics vary within the same scale of measurement: for example, Cohen’s kappa and the true skill statistic (TSS) may range between -1 and 1, while most other widely used metrics range only between 0 and 1. Values of different measures are thus not directly comparable, and e.g. a kappa or TSS value of 0.6 does not denote (although it may at first sight suggest) lower discriminative accuracy than an area under the curve (AUC) of 0.8. Yet, these measures are often presented side by side without a clear acknowledgement of this scale difference. I propose clearly acknowledging such difference, or else using a simple formula to standardize these measures so that their values can be compared more directly. The following equation converts an evaluation score that ranges from -1 to 1 into its corresponding value in the 0-to-1 scale: (score+1)/2. Conversion can also be done the other way around with 2(score-0.5). This standardization is implemented in the modEvA* package for R (currently available on R-Forge), both as an independent function and as an option within other functions that compute and compare model evaluation measures.
id RCAP_379bc3735a6a57f85af8f6b9bcbe26bb
oai_identifier_str oai:dspace.uevora.pt:10174/22796
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Re-scaling of model evaluation measures to allow direct comparison of their valuesmodel evaluationmodel accuracySpecies distribution models are increasingly used in ecology, biogeography and climate change research, and are usually complemented with one or more metrics evaluating their performance. Not all metrics vary within the same scale of measurement: for example, Cohen’s kappa and the true skill statistic (TSS) may range between -1 and 1, while most other widely used metrics range only between 0 and 1. Values of different measures are thus not directly comparable, and e.g. a kappa or TSS value of 0.6 does not denote (although it may at first sight suggest) lower discriminative accuracy than an area under the curve (AUC) of 0.8. Yet, these measures are often presented side by side without a clear acknowledgement of this scale difference. I propose clearly acknowledging such difference, or else using a simple formula to standardize these measures so that their values can be compared more directly. The following equation converts an evaluation score that ranges from -1 to 1 into its corresponding value in the 0-to-1 scale: (score+1)/2. Conversion can also be done the other way around with 2(score-0.5). This standardization is implemented in the modEvA* package for R (currently available on R-Forge), both as an independent function and as an option within other functions that compute and compare model evaluation measures.2018-03-02T17:20:26Z2018-03-022015-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/22796http://hdl.handle.net/10174/22796https://doi.org/10.5281/zenodo.15487engBarbosa A.M. (2015) Re-scaling of model evaluation measures to allow direct comparison of their values. Journal of Brief Ideas, 10.5281/zenodo.15487http://beta.briefideas.org/ideas/3f1bf29b47a5a2e80894a925846471f5barbosa@uevora.pt221Barbosa, A. Márciainfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-03T19:13:45Zoai:dspace.uevora.pt:10174/22796Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:13:29.453045Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Re-scaling of model evaluation measures to allow direct comparison of their values
title Re-scaling of model evaluation measures to allow direct comparison of their values
spellingShingle Re-scaling of model evaluation measures to allow direct comparison of their values
Barbosa, A. Márcia
model evaluation
model accuracy
title_short Re-scaling of model evaluation measures to allow direct comparison of their values
title_full Re-scaling of model evaluation measures to allow direct comparison of their values
title_fullStr Re-scaling of model evaluation measures to allow direct comparison of their values
title_full_unstemmed Re-scaling of model evaluation measures to allow direct comparison of their values
title_sort Re-scaling of model evaluation measures to allow direct comparison of their values
author Barbosa, A. Márcia
author_facet Barbosa, A. Márcia
author_role author
dc.contributor.author.fl_str_mv Barbosa, A. Márcia
dc.subject.por.fl_str_mv model evaluation
model accuracy
topic model evaluation
model accuracy
description Species distribution models are increasingly used in ecology, biogeography and climate change research, and are usually complemented with one or more metrics evaluating their performance. Not all metrics vary within the same scale of measurement: for example, Cohen’s kappa and the true skill statistic (TSS) may range between -1 and 1, while most other widely used metrics range only between 0 and 1. Values of different measures are thus not directly comparable, and e.g. a kappa or TSS value of 0.6 does not denote (although it may at first sight suggest) lower discriminative accuracy than an area under the curve (AUC) of 0.8. Yet, these measures are often presented side by side without a clear acknowledgement of this scale difference. I propose clearly acknowledging such difference, or else using a simple formula to standardize these measures so that their values can be compared more directly. The following equation converts an evaluation score that ranges from -1 to 1 into its corresponding value in the 0-to-1 scale: (score+1)/2. Conversion can also be done the other way around with 2(score-0.5). This standardization is implemented in the modEvA* package for R (currently available on R-Forge), both as an independent function and as an option within other functions that compute and compare model evaluation measures.
publishDate 2015
dc.date.none.fl_str_mv 2015-01-01T00:00:00Z
2018-03-02T17:20:26Z
2018-03-02
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10174/22796
http://hdl.handle.net/10174/22796
https://doi.org/10.5281/zenodo.15487
url http://hdl.handle.net/10174/22796
https://doi.org/10.5281/zenodo.15487
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Barbosa A.M. (2015) Re-scaling of model evaluation measures to allow direct comparison of their values. Journal of Brief Ideas, 10.5281/zenodo.15487
http://beta.briefideas.org/ideas/3f1bf29b47a5a2e80894a925846471f5
barbosa@uevora.pt
221
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799136616760999936