Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning

Detalhes bibliográficos
Autor(a) principal: Mohsine, Ismail
Data de Publicação: 2023
Outros Autores: Kacimi, Ilias, Valles, Vincent, Leblanc, Marc, El Mahrad, Badr, Dassonville, Fabrice, Kassou, Nadia, Bouramtane, Tarik, Abraham, Shiny, Touiouine, Abdessamad, Jabrane, Meryem, Touzani, Meryem, Barry, Abdoul Azize, Yameogo, Suzanne, Barbiero, Laurent
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.1/20311
Resumo: In order to facilitate the monitoring of groundwater quality in France, the groundwater bodies (GWB) in the Provence-Alpes-Cote d'Azur region have been grouped into 11 homogeneous clusters on the basis of their physico-chemical and bacteriological characteristics. This study aims to test the legitimacy of this grouping by predicting whether water samples belong to a given sampling point, GWB or group of GWBs. To this end, 8673 observations and 18 parameters were extracted from the Size-Eaux database, and this dataset was processed using discriminant analysis and various machine learning algorithms. The results indicate an accuracy of 67% using linear discriminant analysis and 69 to 83% using ML algorithms, while quadratic discriminant analysis underperforms in comparison, yielding a less accurate prediction of 59%. The importance of each parameter in the prediction was assessed using an approach combining recursive feature elimination (RFE) techniques and random forest feature importance (RFFI). Major ions show high spatial range and play the main role in discrimination, while trace elements and bacteriological parameters of high local and/or temporal variability only play a minor role. The disparity of the results according to the characteristics of the GWB groups (geography, altitude, lithology, etc.) is discussed. Validating the grouping of GWBs will enable monitoring and surveillance strategies to be redirected on the basis of fewer, homogeneous hydrogeological units, in order to optimize sustainable management of the resource by the health agencies.
id RCAP_ccb0938ff867f379a4f6f043304ed69f
oai_identifier_str oai:sapientia.ualg.pt:10400.1/20311
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine LearningGroundwater bodiesMachine learningDiscriminant analysisChemical compositionBacteriological compositionPACA regionFranceIn order to facilitate the monitoring of groundwater quality in France, the groundwater bodies (GWB) in the Provence-Alpes-Cote d'Azur region have been grouped into 11 homogeneous clusters on the basis of their physico-chemical and bacteriological characteristics. This study aims to test the legitimacy of this grouping by predicting whether water samples belong to a given sampling point, GWB or group of GWBs. To this end, 8673 observations and 18 parameters were extracted from the Size-Eaux database, and this dataset was processed using discriminant analysis and various machine learning algorithms. The results indicate an accuracy of 67% using linear discriminant analysis and 69 to 83% using ML algorithms, while quadratic discriminant analysis underperforms in comparison, yielding a less accurate prediction of 59%. The importance of each parameter in the prediction was assessed using an approach combining recursive feature elimination (RFE) techniques and random forest feature importance (RFFI). Major ions show high spatial range and play the main role in discrimination, while trace elements and bacteriological parameters of high local and/or temporal variability only play a minor role. The disparity of the results according to the characteristics of the GWB groups (geography, altitude, lithology, etc.) is discussed. Validating the grouping of GWBs will enable monitoring and surveillance strategies to be redirected on the basis of fewer, homogeneous hydrogeological units, in order to optimize sustainable management of the resource by the health agencies.MDPISapientiaMohsine, IsmailKacimi, IliasValles, VincentLeblanc, MarcEl Mahrad, BadrDassonville, FabriceKassou, NadiaBouramtane, TarikAbraham, ShinyTouiouine, AbdessamadJabrane, MeryemTouzani, MeryemBarry, Abdoul AzizeYameogo, SuzanneBarbiero, Laurent2024-01-18T10:55:28Z20232023-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.1/20311eng10.3390/hydrology101202302306-5338info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-24T02:00:52Zoai:sapientia.ualg.pt:10400.1/20311Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:56:49.412461Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning
title Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning
spellingShingle Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning
Mohsine, Ismail
Groundwater bodies
Machine learning
Discriminant analysis
Chemical composition
Bacteriological composition
PACA region
France
title_short Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning
title_full Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning
title_fullStr Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning
title_full_unstemmed Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning
title_sort Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning
author Mohsine, Ismail
author_facet Mohsine, Ismail
Kacimi, Ilias
Valles, Vincent
Leblanc, Marc
El Mahrad, Badr
Dassonville, Fabrice
Kassou, Nadia
Bouramtane, Tarik
Abraham, Shiny
Touiouine, Abdessamad
Jabrane, Meryem
Touzani, Meryem
Barry, Abdoul Azize
Yameogo, Suzanne
Barbiero, Laurent
author_role author
author2 Kacimi, Ilias
Valles, Vincent
Leblanc, Marc
El Mahrad, Badr
Dassonville, Fabrice
Kassou, Nadia
Bouramtane, Tarik
Abraham, Shiny
Touiouine, Abdessamad
Jabrane, Meryem
Touzani, Meryem
Barry, Abdoul Azize
Yameogo, Suzanne
Barbiero, Laurent
author2_role author
author
author
author
author
author
author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv Sapientia
dc.contributor.author.fl_str_mv Mohsine, Ismail
Kacimi, Ilias
Valles, Vincent
Leblanc, Marc
El Mahrad, Badr
Dassonville, Fabrice
Kassou, Nadia
Bouramtane, Tarik
Abraham, Shiny
Touiouine, Abdessamad
Jabrane, Meryem
Touzani, Meryem
Barry, Abdoul Azize
Yameogo, Suzanne
Barbiero, Laurent
dc.subject.por.fl_str_mv Groundwater bodies
Machine learning
Discriminant analysis
Chemical composition
Bacteriological composition
PACA region
France
topic Groundwater bodies
Machine learning
Discriminant analysis
Chemical composition
Bacteriological composition
PACA region
France
description In order to facilitate the monitoring of groundwater quality in France, the groundwater bodies (GWB) in the Provence-Alpes-Cote d'Azur region have been grouped into 11 homogeneous clusters on the basis of their physico-chemical and bacteriological characteristics. This study aims to test the legitimacy of this grouping by predicting whether water samples belong to a given sampling point, GWB or group of GWBs. To this end, 8673 observations and 18 parameters were extracted from the Size-Eaux database, and this dataset was processed using discriminant analysis and various machine learning algorithms. The results indicate an accuracy of 67% using linear discriminant analysis and 69 to 83% using ML algorithms, while quadratic discriminant analysis underperforms in comparison, yielding a less accurate prediction of 59%. The importance of each parameter in the prediction was assessed using an approach combining recursive feature elimination (RFE) techniques and random forest feature importance (RFFI). Major ions show high spatial range and play the main role in discrimination, while trace elements and bacteriological parameters of high local and/or temporal variability only play a minor role. The disparity of the results according to the characteristics of the GWB groups (geography, altitude, lithology, etc.) is discussed. Validating the grouping of GWBs will enable monitoring and surveillance strategies to be redirected on the basis of fewer, homogeneous hydrogeological units, in order to optimize sustainable management of the resource by the health agencies.
publishDate 2023
dc.date.none.fl_str_mv 2023
2023-01-01T00:00:00Z
2024-01-18T10:55:28Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.1/20311
url http://hdl.handle.net/10400.1/20311
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.3390/hydrology10120230
2306-5338
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv MDPI
publisher.none.fl_str_mv MDPI
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799137056129024000