A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps

Detalhes bibliográficos
Autor(a) principal: Amini Tehrani, Nasrin
Data de Publicação: 2022
Outros Autores: Naimi, Babak, Jaboyedoff, Michel
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10174/35711
https://doi.org/10.1016/j.ecoinf.2021.101501
Resumo: It is essential to accurately model species distributions and biodiversity in response to many ecological and conservation challenges. The primary means of reliable decision-making on conservation priority are the data on the distributions and abundance of species. However, finding data that is accurate and reliable for predicting species distribution could be challenging. Data could come from different sources, with different designs, coverage, and potential sampling biases. In this study, we examined the emerging methods of modelling species distribution that integrate data from multiple sources such as systematic or standardized and casual or occasional surveys. We applied two modelling approaches, “data-pooling” and “ model-based data integration” that each involves combining various datasets to measure environmental interactions and clarify the distribution of species. Our paper demonstrates a reliable data integration workflow that includes gathering information on model-based data integration, creating a sub-model of each dataset independently, and finally, combining it into a single final model. We have shown that this is a more reliable way of developing a model than a data pooling strategy that combines multiple data sources to fit a single model. Moreover, data integration approaches could improve the poor predictive performance of systematic small datasets, through model-based data integration techniques that enhance the predictive accuracy of Species Distribution Models. We also identified, consistent with previous research, that machine learning algorithms are the most accurate techniques to predict bird species distribution in our heterogeneous study area in the western Swiss Alps. In particular, tree-dependent ensembles of Random Forest (RF) contribute to a better understanding of the interactions between species and the environment.
id RCAP_88c72fd77cebf5f72a20c4fbb46eb5cc
oai_identifier_str oai:dspace.uevora.pt:10174/35711
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss AlpsIt is essential to accurately model species distributions and biodiversity in response to many ecological and conservation challenges. The primary means of reliable decision-making on conservation priority are the data on the distributions and abundance of species. However, finding data that is accurate and reliable for predicting species distribution could be challenging. Data could come from different sources, with different designs, coverage, and potential sampling biases. In this study, we examined the emerging methods of modelling species distribution that integrate data from multiple sources such as systematic or standardized and casual or occasional surveys. We applied two modelling approaches, “data-pooling” and “ model-based data integration” that each involves combining various datasets to measure environmental interactions and clarify the distribution of species. Our paper demonstrates a reliable data integration workflow that includes gathering information on model-based data integration, creating a sub-model of each dataset independently, and finally, combining it into a single final model. We have shown that this is a more reliable way of developing a model than a data pooling strategy that combines multiple data sources to fit a single model. Moreover, data integration approaches could improve the poor predictive performance of systematic small datasets, through model-based data integration techniques that enhance the predictive accuracy of Species Distribution Models. We also identified, consistent with previous research, that machine learning algorithms are the most accurate techniques to predict bird species distribution in our heterogeneous study area in the western Swiss Alps. In particular, tree-dependent ensembles of Random Forest (RF) contribute to a better understanding of the interactions between species and the environment.2023-11-22T11:10:33Z2023-11-222022-07-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/35711http://hdl.handle.net/10174/35711https://doi.org/10.1016/j.ecoinf.2021.101501engTehrani, N. A., Naimi, B., & Jaboyedoff, M. (2022). A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps. Ecological Informatics, 69, 101501.ndbabak.naimi@uevora.ptndAmini Tehrani, NasrinNaimi, BabakJaboyedoff, Michelinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-03T19:39:32Zoai:dspace.uevora.pt:10174/35711Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:24:02.737752Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps
title A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps
spellingShingle A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps
Amini Tehrani, Nasrin
title_short A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps
title_full A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps
title_fullStr A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps
title_full_unstemmed A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps
title_sort A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps
author Amini Tehrani, Nasrin
author_facet Amini Tehrani, Nasrin
Naimi, Babak
Jaboyedoff, Michel
author_role author
author2 Naimi, Babak
Jaboyedoff, Michel
author2_role author
author
dc.contributor.author.fl_str_mv Amini Tehrani, Nasrin
Naimi, Babak
Jaboyedoff, Michel
description It is essential to accurately model species distributions and biodiversity in response to many ecological and conservation challenges. The primary means of reliable decision-making on conservation priority are the data on the distributions and abundance of species. However, finding data that is accurate and reliable for predicting species distribution could be challenging. Data could come from different sources, with different designs, coverage, and potential sampling biases. In this study, we examined the emerging methods of modelling species distribution that integrate data from multiple sources such as systematic or standardized and casual or occasional surveys. We applied two modelling approaches, “data-pooling” and “ model-based data integration” that each involves combining various datasets to measure environmental interactions and clarify the distribution of species. Our paper demonstrates a reliable data integration workflow that includes gathering information on model-based data integration, creating a sub-model of each dataset independently, and finally, combining it into a single final model. We have shown that this is a more reliable way of developing a model than a data pooling strategy that combines multiple data sources to fit a single model. Moreover, data integration approaches could improve the poor predictive performance of systematic small datasets, through model-based data integration techniques that enhance the predictive accuracy of Species Distribution Models. We also identified, consistent with previous research, that machine learning algorithms are the most accurate techniques to predict bird species distribution in our heterogeneous study area in the western Swiss Alps. In particular, tree-dependent ensembles of Random Forest (RF) contribute to a better understanding of the interactions between species and the environment.
publishDate 2022
dc.date.none.fl_str_mv 2022-07-01T00:00:00Z
2023-11-22T11:10:33Z
2023-11-22
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10174/35711
http://hdl.handle.net/10174/35711
https://doi.org/10.1016/j.ecoinf.2021.101501
url http://hdl.handle.net/10174/35711
https://doi.org/10.1016/j.ecoinf.2021.101501
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Tehrani, N. A., Naimi, B., & Jaboyedoff, M. (2022). A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps. Ecological Informatics, 69, 101501.
nd
babak.naimi@uevora.pt
nd
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799136723174686720