Preprocessing procedures and supervised classification applied to a database of systematic soil survey.

Detalhes bibliográficos
Autor(a) principal: VALADARES, A. P.
Data de Publicação: 2019
Outros Autores: COELHO, R. M., OLIVEIRA, S. R. de M.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
Texto Completo: http://www.alice.cnptia.embrapa.br/alice/handle/doc/1118563
http://dx.doi.org/10.1590/1678-992X-2017-0171
Resumo: ABSTRACT:Data Mining techniques play an important role in the prediction of soil spatial distribution in systematic soil surveying, though existing methodologies still lack standardization and a full understanding of their capabilities. The aim of this work was to evaluate the performance of preprocessing procedures and supervised classification approaches for predicting map units from 1:100,000-scale conventional semi-detailed soil surveys. Sheets of the Brazilian National Cartographic System on the 1:50,000 scale, ?Dois Córregos? (?Brotas? 1:100,000-scale sheet), ?São Pedro? and ?Laras? (?Piracicaba? 1:100,000-scale sheet) were used for developing models. Soil map information and predictive environmental covariates for the dataset were obtained from the semi-detailed soil survey of the state of São Paulo, from the Brazilian Institute of Geography and Statistics (IBGE) 1:50,000-scale topographic sheets and from the 1:750,000-scale geological map of the state of São Paulo. The target variable was a soil map unit of four types: local ?soil unit? name and soil class at three hierarchical levels of the Brazilian System of Soil Classification (SiBCS). Different data preprocessing treatments and four algorithms all having different approaches were also tested. Results showed that composite soil map units were not adequate for the machine learning process. Class balance did not contribute to improving the performance of classifiers. Accuracy values of 78 % and a Kappa index of 0.67 were obtained after preprocessing procedures with Random Forest, the algorithm that performed best. Information from conventional map units of semi-detailed (4th order) 1:100,000 soil survey generated models with values for accuracy, precision, sensitivity, specificity and Kappa indexes that support their use in programs for systematic soil surveying.
id EMBR_5a07770d89c058064e55efc298a25728
oai_identifier_str oai:www.alice.cnptia.embrapa.br:doc/1118563
network_acronym_str EMBR
network_name_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository_id_str 2154
spelling Preprocessing procedures and supervised classification applied to a database of systematic soil survey.Aprendizado de máquinaPré-processamentoClassificação de solosRandom forestMachine learning algorithmsTacit soil-landscape relationshipsDigital soil mappingSoloSoilSoil classificationABSTRACT:Data Mining techniques play an important role in the prediction of soil spatial distribution in systematic soil surveying, though existing methodologies still lack standardization and a full understanding of their capabilities. The aim of this work was to evaluate the performance of preprocessing procedures and supervised classification approaches for predicting map units from 1:100,000-scale conventional semi-detailed soil surveys. Sheets of the Brazilian National Cartographic System on the 1:50,000 scale, ?Dois Córregos? (?Brotas? 1:100,000-scale sheet), ?São Pedro? and ?Laras? (?Piracicaba? 1:100,000-scale sheet) were used for developing models. Soil map information and predictive environmental covariates for the dataset were obtained from the semi-detailed soil survey of the state of São Paulo, from the Brazilian Institute of Geography and Statistics (IBGE) 1:50,000-scale topographic sheets and from the 1:750,000-scale geological map of the state of São Paulo. The target variable was a soil map unit of four types: local ?soil unit? name and soil class at three hierarchical levels of the Brazilian System of Soil Classification (SiBCS). Different data preprocessing treatments and four algorithms all having different approaches were also tested. Results showed that composite soil map units were not adequate for the machine learning process. Class balance did not contribute to improving the performance of classifiers. Accuracy values of 78 % and a Kappa index of 0.67 were obtained after preprocessing procedures with Random Forest, the algorithm that performed best. Information from conventional map units of semi-detailed (4th order) 1:100,000 soil survey generated models with values for accuracy, precision, sensitivity, specificity and Kappa indexes that support their use in programs for systematic soil surveying.ALAN PESSOA VALADARES, IAC; RICARDO MARQUES COELHO, IAC; STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA.VALADARES, A. P.COELHO, R. M.OLIVEIRA, S. R. de M.2020-01-11T00:41:14Z2020-01-11T00:41:14Z2020-01-1020192020-01-16T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleScientia Agricola, v. 76, n. 5, p. 439-447, Sept./Oct. 2019.http://www.alice.cnptia.embrapa.br/alice/handle/doc/1118563http://dx.doi.org/10.1590/1678-992X-2017-0171enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2020-01-11T00:41:20Zoai:www.alice.cnptia.embrapa.br:doc/1118563Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542020-01-11T00:41:20Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false
dc.title.none.fl_str_mv Preprocessing procedures and supervised classification applied to a database of systematic soil survey.
title Preprocessing procedures and supervised classification applied to a database of systematic soil survey.
spellingShingle Preprocessing procedures and supervised classification applied to a database of systematic soil survey.
VALADARES, A. P.
Aprendizado de máquina
Pré-processamento
Classificação de solos
Random forest
Machine learning algorithms
Tacit soil-landscape relationships
Digital soil mapping
Solo
Soil
Soil classification
title_short Preprocessing procedures and supervised classification applied to a database of systematic soil survey.
title_full Preprocessing procedures and supervised classification applied to a database of systematic soil survey.
title_fullStr Preprocessing procedures and supervised classification applied to a database of systematic soil survey.
title_full_unstemmed Preprocessing procedures and supervised classification applied to a database of systematic soil survey.
title_sort Preprocessing procedures and supervised classification applied to a database of systematic soil survey.
author VALADARES, A. P.
author_facet VALADARES, A. P.
COELHO, R. M.
OLIVEIRA, S. R. de M.
author_role author
author2 COELHO, R. M.
OLIVEIRA, S. R. de M.
author2_role author
author
dc.contributor.none.fl_str_mv ALAN PESSOA VALADARES, IAC; RICARDO MARQUES COELHO, IAC; STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA.
dc.contributor.author.fl_str_mv VALADARES, A. P.
COELHO, R. M.
OLIVEIRA, S. R. de M.
dc.subject.por.fl_str_mv Aprendizado de máquina
Pré-processamento
Classificação de solos
Random forest
Machine learning algorithms
Tacit soil-landscape relationships
Digital soil mapping
Solo
Soil
Soil classification
topic Aprendizado de máquina
Pré-processamento
Classificação de solos
Random forest
Machine learning algorithms
Tacit soil-landscape relationships
Digital soil mapping
Solo
Soil
Soil classification
description ABSTRACT:Data Mining techniques play an important role in the prediction of soil spatial distribution in systematic soil surveying, though existing methodologies still lack standardization and a full understanding of their capabilities. The aim of this work was to evaluate the performance of preprocessing procedures and supervised classification approaches for predicting map units from 1:100,000-scale conventional semi-detailed soil surveys. Sheets of the Brazilian National Cartographic System on the 1:50,000 scale, ?Dois Córregos? (?Brotas? 1:100,000-scale sheet), ?São Pedro? and ?Laras? (?Piracicaba? 1:100,000-scale sheet) were used for developing models. Soil map information and predictive environmental covariates for the dataset were obtained from the semi-detailed soil survey of the state of São Paulo, from the Brazilian Institute of Geography and Statistics (IBGE) 1:50,000-scale topographic sheets and from the 1:750,000-scale geological map of the state of São Paulo. The target variable was a soil map unit of four types: local ?soil unit? name and soil class at three hierarchical levels of the Brazilian System of Soil Classification (SiBCS). Different data preprocessing treatments and four algorithms all having different approaches were also tested. Results showed that composite soil map units were not adequate for the machine learning process. Class balance did not contribute to improving the performance of classifiers. Accuracy values of 78 % and a Kappa index of 0.67 were obtained after preprocessing procedures with Random Forest, the algorithm that performed best. Information from conventional map units of semi-detailed (4th order) 1:100,000 soil survey generated models with values for accuracy, precision, sensitivity, specificity and Kappa indexes that support their use in programs for systematic soil surveying.
publishDate 2019
dc.date.none.fl_str_mv 2019
2020-01-11T00:41:14Z
2020-01-11T00:41:14Z
2020-01-10
2020-01-16T11:11:11Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv Scientia Agricola, v. 76, n. 5, p. 439-447, Sept./Oct. 2019.
http://www.alice.cnptia.embrapa.br/alice/handle/doc/1118563
http://dx.doi.org/10.1590/1678-992X-2017-0171
identifier_str_mv Scientia Agricola, v. 76, n. 5, p. 439-447, Sept./Oct. 2019.
url http://www.alice.cnptia.embrapa.br/alice/handle/doc/1118563
http://dx.doi.org/10.1590/1678-992X-2017-0171
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron:EMBRAPA
instname_str Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron_str EMBRAPA
institution EMBRAPA
reponame_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
collection Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository.name.fl_str_mv Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
repository.mail.fl_str_mv cg-riaa@embrapa.br
_version_ 1822721450515103744