A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps
Autor(a) principal: | |
---|---|
Data de Publicação: | 2022 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10174/35711 https://doi.org/10.1016/j.ecoinf.2021.101501 |
Resumo: | It is essential to accurately model species distributions and biodiversity in response to many ecological and conservation challenges. The primary means of reliable decision-making on conservation priority are the data on the distributions and abundance of species. However, finding data that is accurate and reliable for predicting species distribution could be challenging. Data could come from different sources, with different designs, coverage, and potential sampling biases. In this study, we examined the emerging methods of modelling species distribution that integrate data from multiple sources such as systematic or standardized and casual or occasional surveys. We applied two modelling approaches, “data-pooling” and “ model-based data integration” that each involves combining various datasets to measure environmental interactions and clarify the distribution of species. Our paper demonstrates a reliable data integration workflow that includes gathering information on model-based data integration, creating a sub-model of each dataset independently, and finally, combining it into a single final model. We have shown that this is a more reliable way of developing a model than a data pooling strategy that combines multiple data sources to fit a single model. Moreover, data integration approaches could improve the poor predictive performance of systematic small datasets, through model-based data integration techniques that enhance the predictive accuracy of Species Distribution Models. We also identified, consistent with previous research, that machine learning algorithms are the most accurate techniques to predict bird species distribution in our heterogeneous study area in the western Swiss Alps. In particular, tree-dependent ensembles of Random Forest (RF) contribute to a better understanding of the interactions between species and the environment. |
id |
RCAP_88c72fd77cebf5f72a20c4fbb46eb5cc |
---|---|
oai_identifier_str |
oai:dspace.uevora.pt:10174/35711 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss AlpsIt is essential to accurately model species distributions and biodiversity in response to many ecological and conservation challenges. The primary means of reliable decision-making on conservation priority are the data on the distributions and abundance of species. However, finding data that is accurate and reliable for predicting species distribution could be challenging. Data could come from different sources, with different designs, coverage, and potential sampling biases. In this study, we examined the emerging methods of modelling species distribution that integrate data from multiple sources such as systematic or standardized and casual or occasional surveys. We applied two modelling approaches, “data-pooling” and “ model-based data integration” that each involves combining various datasets to measure environmental interactions and clarify the distribution of species. Our paper demonstrates a reliable data integration workflow that includes gathering information on model-based data integration, creating a sub-model of each dataset independently, and finally, combining it into a single final model. We have shown that this is a more reliable way of developing a model than a data pooling strategy that combines multiple data sources to fit a single model. Moreover, data integration approaches could improve the poor predictive performance of systematic small datasets, through model-based data integration techniques that enhance the predictive accuracy of Species Distribution Models. We also identified, consistent with previous research, that machine learning algorithms are the most accurate techniques to predict bird species distribution in our heterogeneous study area in the western Swiss Alps. In particular, tree-dependent ensembles of Random Forest (RF) contribute to a better understanding of the interactions between species and the environment.2023-11-22T11:10:33Z2023-11-222022-07-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/35711http://hdl.handle.net/10174/35711https://doi.org/10.1016/j.ecoinf.2021.101501engTehrani, N. A., Naimi, B., & Jaboyedoff, M. (2022). A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps. Ecological Informatics, 69, 101501.ndbabak.naimi@uevora.ptndAmini Tehrani, NasrinNaimi, BabakJaboyedoff, Michelinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-03T19:39:32Zoai:dspace.uevora.pt:10174/35711Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:24:02.737752Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps |
title |
A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps |
spellingShingle |
A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps Amini Tehrani, Nasrin |
title_short |
A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps |
title_full |
A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps |
title_fullStr |
A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps |
title_full_unstemmed |
A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps |
title_sort |
A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps |
author |
Amini Tehrani, Nasrin |
author_facet |
Amini Tehrani, Nasrin Naimi, Babak Jaboyedoff, Michel |
author_role |
author |
author2 |
Naimi, Babak Jaboyedoff, Michel |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Amini Tehrani, Nasrin Naimi, Babak Jaboyedoff, Michel |
description |
It is essential to accurately model species distributions and biodiversity in response to many ecological and conservation challenges. The primary means of reliable decision-making on conservation priority are the data on the distributions and abundance of species. However, finding data that is accurate and reliable for predicting species distribution could be challenging. Data could come from different sources, with different designs, coverage, and potential sampling biases. In this study, we examined the emerging methods of modelling species distribution that integrate data from multiple sources such as systematic or standardized and casual or occasional surveys. We applied two modelling approaches, “data-pooling” and “ model-based data integration” that each involves combining various datasets to measure environmental interactions and clarify the distribution of species. Our paper demonstrates a reliable data integration workflow that includes gathering information on model-based data integration, creating a sub-model of each dataset independently, and finally, combining it into a single final model. We have shown that this is a more reliable way of developing a model than a data pooling strategy that combines multiple data sources to fit a single model. Moreover, data integration approaches could improve the poor predictive performance of systematic small datasets, through model-based data integration techniques that enhance the predictive accuracy of Species Distribution Models. We also identified, consistent with previous research, that machine learning algorithms are the most accurate techniques to predict bird species distribution in our heterogeneous study area in the western Swiss Alps. In particular, tree-dependent ensembles of Random Forest (RF) contribute to a better understanding of the interactions between species and the environment. |
publishDate |
2022 |
dc.date.none.fl_str_mv |
2022-07-01T00:00:00Z 2023-11-22T11:10:33Z 2023-11-22 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10174/35711 http://hdl.handle.net/10174/35711 https://doi.org/10.1016/j.ecoinf.2021.101501 |
url |
http://hdl.handle.net/10174/35711 https://doi.org/10.1016/j.ecoinf.2021.101501 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Tehrani, N. A., Naimi, B., & Jaboyedoff, M. (2022). A data-integration approach to correct sampling bias in species distribution models using multiple datasets of breeding birds in the Swiss Alps. Ecological Informatics, 69, 101501. nd babak.naimi@uevora.pt nd |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799136723174686720 |