Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2.
Autor(a) principal: | |
---|---|
Data de Publicação: | 2023 |
Outros Autores: | , , , |
Tipo de documento: | Conjunto de dados |
Título da fonte: | Repositório de Dados de Pesquisa da EMBRAPA (Redape) |
Texto Completo: | https://doi.org/10.48432/1YYF9Y |
Resumo: | This dataset is related to the paper "Mapping Agricultural Intensification in the Brazilian Savanna: A Machine Learning Approach and Harmonized Data from Landsat Sentinel-2". The study aimed to analyze the performance of the machine learning algorithms Random Forest (RF), Artificial Neural Networks (ANN), and Extreme Gradient Boosting (XGBoost), fed with the time-series of spectral indices NDVI, NDWI, and SAVI from NASA Harmonized Landsat Sentinel-2 (HLS), in detecting intensification (number of cycles) and crop types in Sorriso municipality, Mato Grosso State, in the 2021-2022 crop season, using hierarchical classification in three levels. At Level 1, the target classes were temporary crops (1), native vegetation and silviculture (2), and pastures (3). At Level 2, double cropping (1), single cropping (2), and triple cropping (3). At Level 3, the aim was to identify the second-season crops cultivated in areas identified as double cropping: beans (1), corn (2), cotton (3), and other crops (4). The files available in this dataset are: - Vector files, in shapefiles format, with ground samples obtained during fieldwork in Sorriso, Mato Grosso, between 6-9 June 2022. The files are compressed by level, with the names "Samples_LevelX.zip" in the "Vector" folder. - Worksheets for modeling, in xlsx format, containing the values of the time series of each spectral index, at each classification level, for each sampling point. The files are named "DB_index_LevelX.xlsx" (e.g., "DB_NDVI_Level1"). There is also a PDF file (Order_of_Layers.pdf) to identify the explanatory variables according to the layer order of the original raster stack (e.g., "NDVI_1" is NDVI from September 3rd, 2021). These files are in the "Dataset" folder with subfolders named by level (e.g., "Level_1"). - The R scripts for running the models, getting confusion matrices, and accuracy metrics. The files are named "ALGORITHM_LevelX.R" (e.g., "ANN_Level1.R" or "RF_Level2.R"). In each script, all the modeling processes of all spectral indices are present. For example, the file "ANN_Level1.R" contains the models with the variables NDVI, SAVI, NDWI, and the three combined (AllVI). - The results of each model, in 'rds' format (use R to read it). The files are named "ALGORITHM_index_model_LevelX.rds" (e.g., "XGBoost_NDVI_model_Level2.rds") and allocated in the "Results" folder. - The 27 final maps resulted from spatial predictions in TIFF format (e.g., "Map_ANN_NDVI_Level3_Final.tif"). The files are in the "Final_Maps" folder. Each file contains a brief description, and we encourage users to read the associated paper for further processing details. |
id |
EMBRAPA-08_0399d0a14e236bd614ebf45c3c310f62 |
---|---|
oai_identifier_str |
doi:10.48432/1YYF9Y |
network_acronym_str |
EMBRAPA-08 |
network_name_str |
Repositório de Dados de Pesquisa da EMBRAPA (Redape) |
repository_id_str |
|
spelling |
https://doi.org/10.48432/1YYF9YParreiras, Taya CristoBolfe, Édson LuisSilva, Lucas Augusto Pereira daSano, Edson EyjiBettiol, Giovana MaranhãoMapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2.RedapeThis dataset is related to the paper "Mapping Agricultural Intensification in the Brazilian Savanna: A Machine Learning Approach and Harmonized Data from Landsat Sentinel-2". The study aimed to analyze the performance of the machine learning algorithms Random Forest (RF), Artificial Neural Networks (ANN), and Extreme Gradient Boosting (XGBoost), fed with the time-series of spectral indices NDVI, NDWI, and SAVI from NASA Harmonized Landsat Sentinel-2 (HLS), in detecting intensification (number of cycles) and crop types in Sorriso municipality, Mato Grosso State, in the 2021-2022 crop season, using hierarchical classification in three levels. At Level 1, the target classes were temporary crops (1), native vegetation and silviculture (2), and pastures (3). At Level 2, double cropping (1), single cropping (2), and triple cropping (3). At Level 3, the aim was to identify the second-season crops cultivated in areas identified as double cropping: beans (1), corn (2), cotton (3), and other crops (4). The files available in this dataset are: - Vector files, in shapefiles format, with ground samples obtained during fieldwork in Sorriso, Mato Grosso, between 6-9 June 2022. The files are compressed by level, with the names "Samples_LevelX.zip" in the "Vector" folder. - Worksheets for modeling, in xlsx format, containing the values of the time series of each spectral index, at each classification level, for each sampling point. The files are named "DB_index_LevelX.xlsx" (e.g., "DB_NDVI_Level1"). There is also a PDF file (Order_of_Layers.pdf) to identify the explanatory variables according to the layer order of the original raster stack (e.g., "NDVI_1" is NDVI from September 3rd, 2021). These files are in the "Dataset" folder with subfolders named by level (e.g., "Level_1"). - The R scripts for running the models, getting confusion matrices, and accuracy metrics. The files are named "ALGORITHM_LevelX.R" (e.g., "ANN_Level1.R" or "RF_Level2.R"). In each script, all the modeling processes of all spectral indices are present. For example, the file "ANN_Level1.R" contains the models with the variables NDVI, SAVI, NDWI, and the three combined (AllVI). - The results of each model, in 'rds' format (use R to read it). The files are named "ALGORITHM_index_model_LevelX.rds" (e.g., "XGBoost_NDVI_model_Level2.rds") and allocated in the "Results" folder. - The 27 final maps resulted from spatial predictions in TIFF format (e.g., "Map_ANN_NDVI_Level3_Final.tif"). The files are in the "Final_Maps" folder. Each file contains a brief description, and we encourage users to read the associated paper for further processing details.2023-07-04info:eu-repo/semantics/openAccesshttps://www.redape.dados.embrapa.br/licenses/embrapa-by-nc-4.0.xhtmlEarth and Environmental SciencesOtherMultisensorHLSAgricultureRemote SensingCerradoinfo:eu-repo/semantics/datasetinfo:eu-repo/semantics/datasetinfo:eu-repo/semantics/publishedVersionDatasetreponame:Repositório de Dados de Pesquisa da EMBRAPA (Redape)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPARepositório de Dados de PesquisaPUBhttps://www.redape.dados.embrapa.br/oaiopendoar:2024-03-20T09:48:51.994612Repositório de Dados de Pesquisa da EMBRAPA (Redape) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)falsedoi:10.48432/1YYF9Y |
dc.title.none.fl_str_mv |
Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2. |
title |
Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2. |
spellingShingle |
Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2. Parreiras, Taya Cristo Earth and Environmental Sciences Other Multisensor HLS Agriculture Remote Sensing Cerrado |
title_short |
Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2. |
title_full |
Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2. |
title_fullStr |
Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2. |
title_full_unstemmed |
Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2. |
title_sort |
Mapping agricultural intensification in the Brazilian Savanna: a machine learning approach and harmonized data from Landsat Sentinel-2. |
author |
Parreiras, Taya Cristo |
author_facet |
Parreiras, Taya Cristo Bolfe, Édson Luis Silva, Lucas Augusto Pereira da Sano, Edson Eyji Bettiol, Giovana Maranhão |
author_role |
author |
author2 |
Bolfe, Édson Luis Silva, Lucas Augusto Pereira da Sano, Edson Eyji Bettiol, Giovana Maranhão |
author2_role |
author author author author |
dc.contributor.author.fl_str_mv |
Parreiras, Taya Cristo Bolfe, Édson Luis Silva, Lucas Augusto Pereira da Sano, Edson Eyji Bettiol, Giovana Maranhão |
dc.subject.none.fl_str_mv |
Earth and Environmental Sciences Other Multisensor HLS Agriculture Remote Sensing Cerrado |
topic |
Earth and Environmental Sciences Other Multisensor HLS Agriculture Remote Sensing Cerrado |
description |
This dataset is related to the paper "Mapping Agricultural Intensification in the Brazilian Savanna: A Machine Learning Approach and Harmonized Data from Landsat Sentinel-2". The study aimed to analyze the performance of the machine learning algorithms Random Forest (RF), Artificial Neural Networks (ANN), and Extreme Gradient Boosting (XGBoost), fed with the time-series of spectral indices NDVI, NDWI, and SAVI from NASA Harmonized Landsat Sentinel-2 (HLS), in detecting intensification (number of cycles) and crop types in Sorriso municipality, Mato Grosso State, in the 2021-2022 crop season, using hierarchical classification in three levels. At Level 1, the target classes were temporary crops (1), native vegetation and silviculture (2), and pastures (3). At Level 2, double cropping (1), single cropping (2), and triple cropping (3). At Level 3, the aim was to identify the second-season crops cultivated in areas identified as double cropping: beans (1), corn (2), cotton (3), and other crops (4). The files available in this dataset are: - Vector files, in shapefiles format, with ground samples obtained during fieldwork in Sorriso, Mato Grosso, between 6-9 June 2022. The files are compressed by level, with the names "Samples_LevelX.zip" in the "Vector" folder. - Worksheets for modeling, in xlsx format, containing the values of the time series of each spectral index, at each classification level, for each sampling point. The files are named "DB_index_LevelX.xlsx" (e.g., "DB_NDVI_Level1"). There is also a PDF file (Order_of_Layers.pdf) to identify the explanatory variables according to the layer order of the original raster stack (e.g., "NDVI_1" is NDVI from September 3rd, 2021). These files are in the "Dataset" folder with subfolders named by level (e.g., "Level_1"). - The R scripts for running the models, getting confusion matrices, and accuracy metrics. The files are named "ALGORITHM_LevelX.R" (e.g., "ANN_Level1.R" or "RF_Level2.R"). In each script, all the modeling processes of all spectral indices are present. For example, the file "ANN_Level1.R" contains the models with the variables NDVI, SAVI, NDWI, and the three combined (AllVI). - The results of each model, in 'rds' format (use R to read it). The files are named "ALGORITHM_index_model_LevelX.rds" (e.g., "XGBoost_NDVI_model_Level2.rds") and allocated in the "Results" folder. - The 27 final maps resulted from spatial predictions in TIFF format (e.g., "Map_ANN_NDVI_Level3_Final.tif"). The files are in the "Final_Maps" folder. Each file contains a brief description, and we encourage users to read the associated paper for further processing details. |
publishDate |
2023 |
dc.date.issued.fl_str_mv |
2023-07-04 |
dc.type.openaire.fl_str_mv |
info:eu-repo/semantics/dataset |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.none.fl_str_mv |
info:eu-repo/semantics/dataset |
format |
dataset |
status_str |
publishedVersion |
dc.identifier.url.fl_str_mv |
https://doi.org/10.48432/1YYF9Y |
url |
https://doi.org/10.48432/1YYF9Y |
dc.rights.openaire.fl_str_mv |
info:eu-repo/semantics/openAccess https://www.redape.dados.embrapa.br/licenses/embrapa-by-nc-4.0.xhtml |
eu_rights_str_mv |
openAccess |
rights_invalid_str_mv |
https://www.redape.dados.embrapa.br/licenses/embrapa-by-nc-4.0.xhtml |
dc.format.none.fl_str_mv |
Dataset |
dc.publisher.none.fl_str_mv |
Redape |
publisher.none.fl_str_mv |
Redape |
dc.source.none.fl_str_mv |
reponame:Repositório de Dados de Pesquisa da EMBRAPA (Redape) instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa) instacron:EMBRAPA |
instname_str |
Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
instacron_str |
EMBRAPA |
institution |
EMBRAPA |
reponame_str |
Repositório de Dados de Pesquisa da EMBRAPA (Redape) |
collection |
Repositório de Dados de Pesquisa da EMBRAPA (Redape) |
repository.name.fl_str_mv |
Repositório de Dados de Pesquisa da EMBRAPA (Redape) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
repository.mail.fl_str_mv |
|
_version_ |
1808846633171943424 |