Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2.

Detalhes bibliográficos
Autor(a) principal: Parreiras, Taya Cristo
Data de Publicação: 2023
Outros Autores: Bolfe, Édson Luis, Silva, Lucas Augusto Pereira da, Sano, Edson Eyji, Bettiol, Giovana Maranhão
Tipo de documento: Conjunto de dados
Título da fonte: Repositório de Dados de Pesquisa da EMBRAPA (Redape)
Texto Completo: https://doi.org/10.48432/1YYF9Y
Resumo: This dataset is related to the paper "Mapping Agricultural Intensification in the Brazilian Savanna: A Machine Learning Approach and Harmonized Data from Landsat Sentinel-2". The study aimed to analyze the performance of the machine learning algorithms Random Forest (RF), Artificial Neural Networks (ANN), and Extreme Gradient Boosting (XGBoost), fed with the time-series of spectral indices NDVI, NDWI, and SAVI from NASA Harmonized Landsat Sentinel-2 (HLS), in detecting intensification (number of cycles) and crop types in Sorriso municipality, Mato Grosso State, in the 2021-2022 crop season, using hierarchical classification in three levels. At Level 1, the target classes were temporary crops (1), native vegetation and silviculture (2), and pastures (3). At Level 2, double cropping (1), single cropping (2), and triple cropping (3). At Level 3, the aim was to identify the second-season crops cultivated in areas identified as double cropping: beans (1), corn (2), cotton (3), and other crops (4). The files available in this dataset are: - Vector files, in shapefiles format, with ground samples obtained during fieldwork in Sorriso, Mato Grosso, between 6-9 June 2022. The files are compressed by level, with the names "Samples_LevelX.zip" in the "Vector" folder. - Worksheets for modeling, in xlsx format, containing the values of the time series of each spectral index, at each classification level, for each sampling point. The files are named "DB_index_LevelX.xlsx" (e.g., "DB_NDVI_Level1"). There is also a PDF file (Order_of_Layers.pdf) to identify the explanatory variables according to the layer order of the original raster stack (e.g., "NDVI_1" is NDVI from September 3rd, 2021). These files are in the "Dataset" folder with subfolders named by level (e.g., "Level_1"). - The R scripts for running the models, getting confusion matrices, and accuracy metrics. The files are named "ALGORITHM_LevelX.R" (e.g., "ANN_Level1.R" or "RF_Level2.R"). In each script, all the modeling processes of all spectral indices are present. For example, the file "ANN_Level1.R" contains the models with the variables NDVI, SAVI, NDWI, and the three combined (AllVI). - The results of each model, in 'rds' format (use R to read it). The files are named "ALGORITHM_index_model_LevelX.rds" (e.g., "XGBoost_NDVI_model_Level2.rds") and allocated in the "Results" folder. - The 27 final maps resulted from spatial predictions in TIFF format (e.g., "Map_ANN_NDVI_Level3_Final.tif"). The files are in the "Final_Maps" folder. Each file contains a brief description, and we encourage users to read the associated paper for further processing details.
id EMBRAPA-08_0399d0a14e236bd614ebf45c3c310f62
oai_identifier_str doi:10.48432/1YYF9Y
network_acronym_str EMBRAPA-08
network_name_str Repositório de Dados de Pesquisa da EMBRAPA (Redape)
repository_id_str
spelling https://doi.org/10.48432/1YYF9YParreiras, Taya CristoBolfe, Édson LuisSilva, Lucas Augusto Pereira daSano, Edson EyjiBettiol, Giovana MaranhãoMapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2.RedapeThis dataset is related to the paper "Mapping Agricultural Intensification in the Brazilian Savanna: A Machine Learning Approach and Harmonized Data from Landsat Sentinel-2". The study aimed to analyze the performance of the machine learning algorithms Random Forest (RF), Artificial Neural Networks (ANN), and Extreme Gradient Boosting (XGBoost), fed with the time-series of spectral indices NDVI, NDWI, and SAVI from NASA Harmonized Landsat Sentinel-2 (HLS), in detecting intensification (number of cycles) and crop types in Sorriso municipality, Mato Grosso State, in the 2021-2022 crop season, using hierarchical classification in three levels. At Level 1, the target classes were temporary crops (1), native vegetation and silviculture (2), and pastures (3). At Level 2, double cropping (1), single cropping (2), and triple cropping (3). At Level 3, the aim was to identify the second-season crops cultivated in areas identified as double cropping: beans (1), corn (2), cotton (3), and other crops (4). The files available in this dataset are: - Vector files, in shapefiles format, with ground samples obtained during fieldwork in Sorriso, Mato Grosso, between 6-9 June 2022. The files are compressed by level, with the names "Samples_LevelX.zip" in the "Vector" folder. - Worksheets for modeling, in xlsx format, containing the values of the time series of each spectral index, at each classification level, for each sampling point. The files are named "DB_index_LevelX.xlsx" (e.g., "DB_NDVI_Level1"). There is also a PDF file (Order_of_Layers.pdf) to identify the explanatory variables according to the layer order of the original raster stack (e.g., "NDVI_1" is NDVI from September 3rd, 2021). These files are in the "Dataset" folder with subfolders named by level (e.g., "Level_1"). - The R scripts for running the models, getting confusion matrices, and accuracy metrics. The files are named "ALGORITHM_LevelX.R" (e.g., "ANN_Level1.R" or "RF_Level2.R"). In each script, all the modeling processes of all spectral indices are present. For example, the file "ANN_Level1.R" contains the models with the variables NDVI, SAVI, NDWI, and the three combined (AllVI). - The results of each model, in 'rds' format (use R to read it). The files are named "ALGORITHM_index_model_LevelX.rds" (e.g., "XGBoost_NDVI_model_Level2.rds") and allocated in the "Results" folder. - The 27 final maps resulted from spatial predictions in TIFF format (e.g., "Map_ANN_NDVI_Level3_Final.tif"). The files are in the "Final_Maps" folder. Each file contains a brief description, and we encourage users to read the associated paper for further processing details.2023-07-04info:eu-repo/semantics/openAccesshttps://www.redape.dados.embrapa.br/licenses/embrapa-by-nc-4.0.xhtmlEarth and Environmental SciencesOtherMultisensorHLSAgricultureRemote SensingCerradoinfo:eu-repo/semantics/datasetinfo:eu-repo/semantics/datasetinfo:eu-repo/semantics/publishedVersionDatasetreponame:Repositório de Dados de Pesquisa da EMBRAPA (Redape)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPARepositório de Dados de PesquisaPUBhttps://www.redape.dados.embrapa.br/oaiopendoar:2024-03-20T09:48:51.994612Repositório de Dados de Pesquisa da EMBRAPA (Redape) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)falsedoi:10.48432/1YYF9Y
dc.title.none.fl_str_mv Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2.
title Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2.
spellingShingle Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2.
Parreiras, Taya Cristo
Earth and Environmental Sciences
Other
Multisensor
HLS
Agriculture
Remote Sensing
Cerrado
title_short Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2.
title_full Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2.
title_fullStr Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2.
title_full_unstemmed Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2.
title_sort Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2.
author Parreiras, Taya Cristo
author_facet Parreiras, Taya Cristo
Bolfe, Édson Luis
Silva, Lucas Augusto Pereira da
Sano, Edson Eyji
Bettiol, Giovana Maranhão
author_role author
author2 Bolfe, Édson Luis
Silva, Lucas Augusto Pereira da
Sano, Edson Eyji
Bettiol, Giovana Maranhão
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Parreiras, Taya Cristo
Bolfe, Édson Luis
Silva, Lucas Augusto Pereira da
Sano, Edson Eyji
Bettiol, Giovana Maranhão
dc.subject.none.fl_str_mv Earth and Environmental Sciences
Other
Multisensor
HLS
Agriculture
Remote Sensing
Cerrado
topic Earth and Environmental Sciences
Other
Multisensor
HLS
Agriculture
Remote Sensing
Cerrado
description This dataset is related to the paper "Mapping Agricultural Intensification in the Brazilian Savanna: A Machine Learning Approach and Harmonized Data from Landsat Sentinel-2". The study aimed to analyze the performance of the machine learning algorithms Random Forest (RF), Artificial Neural Networks (ANN), and Extreme Gradient Boosting (XGBoost), fed with the time-series of spectral indices NDVI, NDWI, and SAVI from NASA Harmonized Landsat Sentinel-2 (HLS), in detecting intensification (number of cycles) and crop types in Sorriso municipality, Mato Grosso State, in the 2021-2022 crop season, using hierarchical classification in three levels. At Level 1, the target classes were temporary crops (1), native vegetation and silviculture (2), and pastures (3). At Level 2, double cropping (1), single cropping (2), and triple cropping (3). At Level 3, the aim was to identify the second-season crops cultivated in areas identified as double cropping: beans (1), corn (2), cotton (3), and other crops (4). The files available in this dataset are: - Vector files, in shapefiles format, with ground samples obtained during fieldwork in Sorriso, Mato Grosso, between 6-9 June 2022. The files are compressed by level, with the names "Samples_LevelX.zip" in the "Vector" folder. - Worksheets for modeling, in xlsx format, containing the values of the time series of each spectral index, at each classification level, for each sampling point. The files are named "DB_index_LevelX.xlsx" (e.g., "DB_NDVI_Level1"). There is also a PDF file (Order_of_Layers.pdf) to identify the explanatory variables according to the layer order of the original raster stack (e.g., "NDVI_1" is NDVI from September 3rd, 2021). These files are in the "Dataset" folder with subfolders named by level (e.g., "Level_1"). - The R scripts for running the models, getting confusion matrices, and accuracy metrics. The files are named "ALGORITHM_LevelX.R" (e.g., "ANN_Level1.R" or "RF_Level2.R"). In each script, all the modeling processes of all spectral indices are present. For example, the file "ANN_Level1.R" contains the models with the variables NDVI, SAVI, NDWI, and the three combined (AllVI). - The results of each model, in 'rds' format (use R to read it). The files are named "ALGORITHM_index_model_LevelX.rds" (e.g., "XGBoost_NDVI_model_Level2.rds") and allocated in the "Results" folder. - The 27 final maps resulted from spatial predictions in TIFF format (e.g., "Map_ANN_NDVI_Level3_Final.tif"). The files are in the "Final_Maps" folder. Each file contains a brief description, and we encourage users to read the associated paper for further processing details.
publishDate 2023
dc.date.issued.fl_str_mv 2023-07-04
dc.type.openaire.fl_str_mv info:eu-repo/semantics/dataset
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.none.fl_str_mv info:eu-repo/semantics/dataset
format dataset
status_str publishedVersion
dc.identifier.url.fl_str_mv https://doi.org/10.48432/1YYF9Y
url https://doi.org/10.48432/1YYF9Y
dc.rights.openaire.fl_str_mv info:eu-repo/semantics/openAccess
https://www.redape.dados.embrapa.br/licenses/embrapa-by-nc-4.0.xhtml
eu_rights_str_mv openAccess
rights_invalid_str_mv https://www.redape.dados.embrapa.br/licenses/embrapa-by-nc-4.0.xhtml
dc.format.none.fl_str_mv Dataset
dc.publisher.none.fl_str_mv Redape
publisher.none.fl_str_mv Redape
dc.source.none.fl_str_mv reponame:Repositório de Dados de Pesquisa da EMBRAPA (Redape)
instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron:EMBRAPA
instname_str Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron_str EMBRAPA
institution EMBRAPA
reponame_str Repositório de Dados de Pesquisa da EMBRAPA (Redape)
collection Repositório de Dados de Pesquisa da EMBRAPA (Redape)
repository.name.fl_str_mv Repositório de Dados de Pesquisa da EMBRAPA (Redape) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
repository.mail.fl_str_mv
_version_ 1808846633171943424