Preprocessing procedures and supervised classification applied to a database of systematic soil survey.
Autor(a) principal: | |
---|---|
Data de Publicação: | 2019 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
Texto Completo: | http://www.alice.cnptia.embrapa.br/alice/handle/doc/1118563 http://dx.doi.org/10.1590/1678-992X-2017-0171 |
Resumo: | ABSTRACT:Data Mining techniques play an important role in the prediction of soil spatial distribution in systematic soil surveying, though existing methodologies still lack standardization and a full understanding of their capabilities. The aim of this work was to evaluate the performance of preprocessing procedures and supervised classification approaches for predicting map units from 1:100,000-scale conventional semi-detailed soil surveys. Sheets of the Brazilian National Cartographic System on the 1:50,000 scale, ?Dois Córregos? (?Brotas? 1:100,000-scale sheet), ?São Pedro? and ?Laras? (?Piracicaba? 1:100,000-scale sheet) were used for developing models. Soil map information and predictive environmental covariates for the dataset were obtained from the semi-detailed soil survey of the state of São Paulo, from the Brazilian Institute of Geography and Statistics (IBGE) 1:50,000-scale topographic sheets and from the 1:750,000-scale geological map of the state of São Paulo. The target variable was a soil map unit of four types: local ?soil unit? name and soil class at three hierarchical levels of the Brazilian System of Soil Classification (SiBCS). Different data preprocessing treatments and four algorithms all having different approaches were also tested. Results showed that composite soil map units were not adequate for the machine learning process. Class balance did not contribute to improving the performance of classifiers. Accuracy values of 78 % and a Kappa index of 0.67 were obtained after preprocessing procedures with Random Forest, the algorithm that performed best. Information from conventional map units of semi-detailed (4th order) 1:100,000 soil survey generated models with values for accuracy, precision, sensitivity, specificity and Kappa indexes that support their use in programs for systematic soil surveying. |
id |
EMBR_5a07770d89c058064e55efc298a25728 |
---|---|
oai_identifier_str |
oai:www.alice.cnptia.embrapa.br:doc/1118563 |
network_acronym_str |
EMBR |
network_name_str |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
repository_id_str |
2154 |
spelling |
Preprocessing procedures and supervised classification applied to a database of systematic soil survey.Aprendizado de máquinaPré-processamentoClassificação de solosRandom forestMachine learning algorithmsTacit soil-landscape relationshipsDigital soil mappingSoloSoilSoil classificationABSTRACT:Data Mining techniques play an important role in the prediction of soil spatial distribution in systematic soil surveying, though existing methodologies still lack standardization and a full understanding of their capabilities. The aim of this work was to evaluate the performance of preprocessing procedures and supervised classification approaches for predicting map units from 1:100,000-scale conventional semi-detailed soil surveys. Sheets of the Brazilian National Cartographic System on the 1:50,000 scale, ?Dois Córregos? (?Brotas? 1:100,000-scale sheet), ?São Pedro? and ?Laras? (?Piracicaba? 1:100,000-scale sheet) were used for developing models. Soil map information and predictive environmental covariates for the dataset were obtained from the semi-detailed soil survey of the state of São Paulo, from the Brazilian Institute of Geography and Statistics (IBGE) 1:50,000-scale topographic sheets and from the 1:750,000-scale geological map of the state of São Paulo. The target variable was a soil map unit of four types: local ?soil unit? name and soil class at three hierarchical levels of the Brazilian System of Soil Classification (SiBCS). Different data preprocessing treatments and four algorithms all having different approaches were also tested. Results showed that composite soil map units were not adequate for the machine learning process. Class balance did not contribute to improving the performance of classifiers. Accuracy values of 78 % and a Kappa index of 0.67 were obtained after preprocessing procedures with Random Forest, the algorithm that performed best. Information from conventional map units of semi-detailed (4th order) 1:100,000 soil survey generated models with values for accuracy, precision, sensitivity, specificity and Kappa indexes that support their use in programs for systematic soil surveying.ALAN PESSOA VALADARES, IAC; RICARDO MARQUES COELHO, IAC; STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA.VALADARES, A. P.COELHO, R. M.OLIVEIRA, S. R. de M.2020-01-11T00:41:14Z2020-01-11T00:41:14Z2020-01-1020192020-01-16T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleScientia Agricola, v. 76, n. 5, p. 439-447, Sept./Oct. 2019.http://www.alice.cnptia.embrapa.br/alice/handle/doc/1118563http://dx.doi.org/10.1590/1678-992X-2017-0171enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2020-01-11T00:41:20Zoai:www.alice.cnptia.embrapa.br:doc/1118563Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542020-01-11T00:41:20Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false |
dc.title.none.fl_str_mv |
Preprocessing procedures and supervised classification applied to a database of systematic soil survey. |
title |
Preprocessing procedures and supervised classification applied to a database of systematic soil survey. |
spellingShingle |
Preprocessing procedures and supervised classification applied to a database of systematic soil survey. VALADARES, A. P. Aprendizado de máquina Pré-processamento Classificação de solos Random forest Machine learning algorithms Tacit soil-landscape relationships Digital soil mapping Solo Soil Soil classification |
title_short |
Preprocessing procedures and supervised classification applied to a database of systematic soil survey. |
title_full |
Preprocessing procedures and supervised classification applied to a database of systematic soil survey. |
title_fullStr |
Preprocessing procedures and supervised classification applied to a database of systematic soil survey. |
title_full_unstemmed |
Preprocessing procedures and supervised classification applied to a database of systematic soil survey. |
title_sort |
Preprocessing procedures and supervised classification applied to a database of systematic soil survey. |
author |
VALADARES, A. P. |
author_facet |
VALADARES, A. P. COELHO, R. M. OLIVEIRA, S. R. de M. |
author_role |
author |
author2 |
COELHO, R. M. OLIVEIRA, S. R. de M. |
author2_role |
author author |
dc.contributor.none.fl_str_mv |
ALAN PESSOA VALADARES, IAC; RICARDO MARQUES COELHO, IAC; STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA. |
dc.contributor.author.fl_str_mv |
VALADARES, A. P. COELHO, R. M. OLIVEIRA, S. R. de M. |
dc.subject.por.fl_str_mv |
Aprendizado de máquina Pré-processamento Classificação de solos Random forest Machine learning algorithms Tacit soil-landscape relationships Digital soil mapping Solo Soil Soil classification |
topic |
Aprendizado de máquina Pré-processamento Classificação de solos Random forest Machine learning algorithms Tacit soil-landscape relationships Digital soil mapping Solo Soil Soil classification |
description |
ABSTRACT:Data Mining techniques play an important role in the prediction of soil spatial distribution in systematic soil surveying, though existing methodologies still lack standardization and a full understanding of their capabilities. The aim of this work was to evaluate the performance of preprocessing procedures and supervised classification approaches for predicting map units from 1:100,000-scale conventional semi-detailed soil surveys. Sheets of the Brazilian National Cartographic System on the 1:50,000 scale, ?Dois Córregos? (?Brotas? 1:100,000-scale sheet), ?São Pedro? and ?Laras? (?Piracicaba? 1:100,000-scale sheet) were used for developing models. Soil map information and predictive environmental covariates for the dataset were obtained from the semi-detailed soil survey of the state of São Paulo, from the Brazilian Institute of Geography and Statistics (IBGE) 1:50,000-scale topographic sheets and from the 1:750,000-scale geological map of the state of São Paulo. The target variable was a soil map unit of four types: local ?soil unit? name and soil class at three hierarchical levels of the Brazilian System of Soil Classification (SiBCS). Different data preprocessing treatments and four algorithms all having different approaches were also tested. Results showed that composite soil map units were not adequate for the machine learning process. Class balance did not contribute to improving the performance of classifiers. Accuracy values of 78 % and a Kappa index of 0.67 were obtained after preprocessing procedures with Random Forest, the algorithm that performed best. Information from conventional map units of semi-detailed (4th order) 1:100,000 soil survey generated models with values for accuracy, precision, sensitivity, specificity and Kappa indexes that support their use in programs for systematic soil surveying. |
publishDate |
2019 |
dc.date.none.fl_str_mv |
2019 2020-01-11T00:41:14Z 2020-01-11T00:41:14Z 2020-01-10 2020-01-16T11:11:11Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
Scientia Agricola, v. 76, n. 5, p. 439-447, Sept./Oct. 2019. http://www.alice.cnptia.embrapa.br/alice/handle/doc/1118563 http://dx.doi.org/10.1590/1678-992X-2017-0171 |
identifier_str_mv |
Scientia Agricola, v. 76, n. 5, p. 439-447, Sept./Oct. 2019. |
url |
http://www.alice.cnptia.embrapa.br/alice/handle/doc/1118563 http://dx.doi.org/10.1590/1678-992X-2017-0171 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa) instacron:EMBRAPA |
instname_str |
Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
instacron_str |
EMBRAPA |
institution |
EMBRAPA |
reponame_str |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
collection |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
repository.name.fl_str_mv |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
repository.mail.fl_str_mv |
cg-riaa@embrapa.br |
_version_ |
1822721450515103744 |