Satellite-based estimation of soil organic carbon in Portuguese grasslands

Detalhes bibliográficos
Autor(a) principal: Morais, Tiago G.
Data de Publicação: 2023
Outros Autores: Jongen, Marjan, Tufik, Camila, Rodrigues, Nuno R., Gama, Ivo, Serrano, João, Gonçalves, Maria C., Mano, Raquel, Domingos, Tiago, Teixeira, Ricardo F. M.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.5/29346
Resumo: Soil organic carbon (SOC) sequestration is one of the main ecosystem services provided by well-managed grasslands. In the Mediterranean region, sown biodiverse pastures (SBP) rich in legumes are a nature-based, innovative, and economically competitive livestock production system. As a co-benefit of increased yield, they also contribute to carbon sequestration through SOC accumulation. However, SOC monitoring in SBP require time-consuming and costly field work. Methods: In this study, we propose an expedited and cost-effective indirect method to estimate SOC content. In this study, we developed models for estimating SOC concentration by combining remote sensing (RS) and machine learning (ML) approaches. We used field-measured data collected from nine different farms during four production years (between 2017 and 2021). We utilized RS data from both Sentinel-1 and Sentinel-2, including reflectance bands and vegetation indices. We also used other covariates such as climatic, soil, and terrain variables, for a total of 49 inputs. To reduce multicollinearity problems between the different variables, we performed feature selection using the sequential feature selection approach. We then estimated SOC content using both the complete dataset and the selected features. Multiple ML methods were tested and compared, including multiple linear regression (MLR), random forests (RF), extreme gradient boosting (XGB), and artificial neural networks (ANN). We used a random cross-validation approach (with 10 folds). To find the hyperparameters that led to the best performance, we used a Bayesian optimization approach. Results: Results showed that the XGB method led to higher estimation accuracy than the other methods, and the estimation performance was not significantly influenced by the feature selection approach. For XGB, the average root mean square error (RMSE), measured on the test set among all folds, was 2.78 g kg−1 (r2 equal to 0.68) without feature selection, and 2.77 g kg−1 (r2 equal to 0.68) with feature selection (average SOC content is 13 g kg−1). The models were applied to obtain SOC content maps for all farms. Discussion: This work demonstrated that combining RS and ML can help obtain quick estimations of SOC content to assist with SBP management
id RCAP_7717181a94ebf2b0028b8895592d54db
oai_identifier_str oai:www.repository.utl.pt:10400.5/29346
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Satellite-based estimation of soil organic carbon in Portuguese grasslandsremote sensingsatellitecross-validationfeatures selectionsown biodiverse pastureSoil organic carbon (SOC) sequestration is one of the main ecosystem services provided by well-managed grasslands. In the Mediterranean region, sown biodiverse pastures (SBP) rich in legumes are a nature-based, innovative, and economically competitive livestock production system. As a co-benefit of increased yield, they also contribute to carbon sequestration through SOC accumulation. However, SOC monitoring in SBP require time-consuming and costly field work. Methods: In this study, we propose an expedited and cost-effective indirect method to estimate SOC content. In this study, we developed models for estimating SOC concentration by combining remote sensing (RS) and machine learning (ML) approaches. We used field-measured data collected from nine different farms during four production years (between 2017 and 2021). We utilized RS data from both Sentinel-1 and Sentinel-2, including reflectance bands and vegetation indices. We also used other covariates such as climatic, soil, and terrain variables, for a total of 49 inputs. To reduce multicollinearity problems between the different variables, we performed feature selection using the sequential feature selection approach. We then estimated SOC content using both the complete dataset and the selected features. Multiple ML methods were tested and compared, including multiple linear regression (MLR), random forests (RF), extreme gradient boosting (XGB), and artificial neural networks (ANN). We used a random cross-validation approach (with 10 folds). To find the hyperparameters that led to the best performance, we used a Bayesian optimization approach. Results: Results showed that the XGB method led to higher estimation accuracy than the other methods, and the estimation performance was not significantly influenced by the feature selection approach. For XGB, the average root mean square error (RMSE), measured on the test set among all folds, was 2.78 g kg−1 (r2 equal to 0.68) without feature selection, and 2.77 g kg−1 (r2 equal to 0.68) with feature selection (average SOC content is 13 g kg−1). The models were applied to obtain SOC content maps for all farms. Discussion: This work demonstrated that combining RS and ML can help obtain quick estimations of SOC content to assist with SBP managementFrontiersRepositório da Universidade de LisboaMorais, Tiago G.Jongen, MarjanTufik, CamilaRodrigues, Nuno R.Gama, IvoSerrano, JoãoGonçalves, Maria C.Mano, RaquelDomingos, TiagoTeixeira, Ricardo F. M.2023-11-09T12:16:08Z20232023-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.5/29346engMorais TG, Jongen M, Tufik C, Rodrigues NR, Gama I, Serrano J, Gonçalves MC, Mano R, Domingos T and Teixeira RFM (2023), Satellite-based estimation of soil organic carbon in Portuguese grasslands. Front. Environ. Sci. 11:124010610.3389/fenvs.2023.1240106info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-12T01:31:51Zoai:www.repository.utl.pt:10400.5/29346Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T22:38:00.647432Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Satellite-based estimation of soil organic carbon in Portuguese grasslands
title Satellite-based estimation of soil organic carbon in Portuguese grasslands
spellingShingle Satellite-based estimation of soil organic carbon in Portuguese grasslands
Morais, Tiago G.
remote sensing
satellite
cross-validation
features selection
sown biodiverse pasture
title_short Satellite-based estimation of soil organic carbon in Portuguese grasslands
title_full Satellite-based estimation of soil organic carbon in Portuguese grasslands
title_fullStr Satellite-based estimation of soil organic carbon in Portuguese grasslands
title_full_unstemmed Satellite-based estimation of soil organic carbon in Portuguese grasslands
title_sort Satellite-based estimation of soil organic carbon in Portuguese grasslands
author Morais, Tiago G.
author_facet Morais, Tiago G.
Jongen, Marjan
Tufik, Camila
Rodrigues, Nuno R.
Gama, Ivo
Serrano, João
Gonçalves, Maria C.
Mano, Raquel
Domingos, Tiago
Teixeira, Ricardo F. M.
author_role author
author2 Jongen, Marjan
Tufik, Camila
Rodrigues, Nuno R.
Gama, Ivo
Serrano, João
Gonçalves, Maria C.
Mano, Raquel
Domingos, Tiago
Teixeira, Ricardo F. M.
author2_role author
author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv Repositório da Universidade de Lisboa
dc.contributor.author.fl_str_mv Morais, Tiago G.
Jongen, Marjan
Tufik, Camila
Rodrigues, Nuno R.
Gama, Ivo
Serrano, João
Gonçalves, Maria C.
Mano, Raquel
Domingos, Tiago
Teixeira, Ricardo F. M.
dc.subject.por.fl_str_mv remote sensing
satellite
cross-validation
features selection
sown biodiverse pasture
topic remote sensing
satellite
cross-validation
features selection
sown biodiverse pasture
description Soil organic carbon (SOC) sequestration is one of the main ecosystem services provided by well-managed grasslands. In the Mediterranean region, sown biodiverse pastures (SBP) rich in legumes are a nature-based, innovative, and economically competitive livestock production system. As a co-benefit of increased yield, they also contribute to carbon sequestration through SOC accumulation. However, SOC monitoring in SBP require time-consuming and costly field work. Methods: In this study, we propose an expedited and cost-effective indirect method to estimate SOC content. In this study, we developed models for estimating SOC concentration by combining remote sensing (RS) and machine learning (ML) approaches. We used field-measured data collected from nine different farms during four production years (between 2017 and 2021). We utilized RS data from both Sentinel-1 and Sentinel-2, including reflectance bands and vegetation indices. We also used other covariates such as climatic, soil, and terrain variables, for a total of 49 inputs. To reduce multicollinearity problems between the different variables, we performed feature selection using the sequential feature selection approach. We then estimated SOC content using both the complete dataset and the selected features. Multiple ML methods were tested and compared, including multiple linear regression (MLR), random forests (RF), extreme gradient boosting (XGB), and artificial neural networks (ANN). We used a random cross-validation approach (with 10 folds). To find the hyperparameters that led to the best performance, we used a Bayesian optimization approach. Results: Results showed that the XGB method led to higher estimation accuracy than the other methods, and the estimation performance was not significantly influenced by the feature selection approach. For XGB, the average root mean square error (RMSE), measured on the test set among all folds, was 2.78 g kg−1 (r2 equal to 0.68) without feature selection, and 2.77 g kg−1 (r2 equal to 0.68) with feature selection (average SOC content is 13 g kg−1). The models were applied to obtain SOC content maps for all farms. Discussion: This work demonstrated that combining RS and ML can help obtain quick estimations of SOC content to assist with SBP management
publishDate 2023
dc.date.none.fl_str_mv 2023-11-09T12:16:08Z
2023
2023-01-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.5/29346
url http://hdl.handle.net/10400.5/29346
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Morais TG, Jongen M, Tufik C, Rodrigues NR, Gama I, Serrano J, Gonçalves MC, Mano R, Domingos T and Teixeira RFM (2023), Satellite-based estimation of soil organic carbon in Portuguese grasslands. Front. Environ. Sci. 11:1240106
10.3389/fenvs.2023.1240106
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Frontiers
publisher.none.fl_str_mv Frontiers
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799134938856947712