Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning
Autor(a) principal: | |
---|---|
Data de Publicação: | 2023 |
Outros Autores: | , , , , , , , , , , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.1/20311 |
Resumo: | In order to facilitate the monitoring of groundwater quality in France, the groundwater bodies (GWB) in the Provence-Alpes-Cote d'Azur region have been grouped into 11 homogeneous clusters on the basis of their physico-chemical and bacteriological characteristics. This study aims to test the legitimacy of this grouping by predicting whether water samples belong to a given sampling point, GWB or group of GWBs. To this end, 8673 observations and 18 parameters were extracted from the Size-Eaux database, and this dataset was processed using discriminant analysis and various machine learning algorithms. The results indicate an accuracy of 67% using linear discriminant analysis and 69 to 83% using ML algorithms, while quadratic discriminant analysis underperforms in comparison, yielding a less accurate prediction of 59%. The importance of each parameter in the prediction was assessed using an approach combining recursive feature elimination (RFE) techniques and random forest feature importance (RFFI). Major ions show high spatial range and play the main role in discrimination, while trace elements and bacteriological parameters of high local and/or temporal variability only play a minor role. The disparity of the results according to the characteristics of the GWB groups (geography, altitude, lithology, etc.) is discussed. Validating the grouping of GWBs will enable monitoring and surveillance strategies to be redirected on the basis of fewer, homogeneous hydrogeological units, in order to optimize sustainable management of the resource by the health agencies. |
id |
RCAP_ccb0938ff867f379a4f6f043304ed69f |
---|---|
oai_identifier_str |
oai:sapientia.ualg.pt:10400.1/20311 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine LearningGroundwater bodiesMachine learningDiscriminant analysisChemical compositionBacteriological compositionPACA regionFranceIn order to facilitate the monitoring of groundwater quality in France, the groundwater bodies (GWB) in the Provence-Alpes-Cote d'Azur region have been grouped into 11 homogeneous clusters on the basis of their physico-chemical and bacteriological characteristics. This study aims to test the legitimacy of this grouping by predicting whether water samples belong to a given sampling point, GWB or group of GWBs. To this end, 8673 observations and 18 parameters were extracted from the Size-Eaux database, and this dataset was processed using discriminant analysis and various machine learning algorithms. The results indicate an accuracy of 67% using linear discriminant analysis and 69 to 83% using ML algorithms, while quadratic discriminant analysis underperforms in comparison, yielding a less accurate prediction of 59%. The importance of each parameter in the prediction was assessed using an approach combining recursive feature elimination (RFE) techniques and random forest feature importance (RFFI). Major ions show high spatial range and play the main role in discrimination, while trace elements and bacteriological parameters of high local and/or temporal variability only play a minor role. The disparity of the results according to the characteristics of the GWB groups (geography, altitude, lithology, etc.) is discussed. Validating the grouping of GWBs will enable monitoring and surveillance strategies to be redirected on the basis of fewer, homogeneous hydrogeological units, in order to optimize sustainable management of the resource by the health agencies.MDPISapientiaMohsine, IsmailKacimi, IliasValles, VincentLeblanc, MarcEl Mahrad, BadrDassonville, FabriceKassou, NadiaBouramtane, TarikAbraham, ShinyTouiouine, AbdessamadJabrane, MeryemTouzani, MeryemBarry, Abdoul AzizeYameogo, SuzanneBarbiero, Laurent2024-01-18T10:55:28Z20232023-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.1/20311eng10.3390/hydrology101202302306-5338info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-24T02:00:52Zoai:sapientia.ualg.pt:10400.1/20311Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:56:49.412461Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning |
title |
Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning |
spellingShingle |
Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning Mohsine, Ismail Groundwater bodies Machine learning Discriminant analysis Chemical composition Bacteriological composition PACA region France |
title_short |
Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning |
title_full |
Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning |
title_fullStr |
Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning |
title_full_unstemmed |
Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning |
title_sort |
Differentiation of multi-parametric groups of groundwater bodies through discriminant analysis and machine Learning |
author |
Mohsine, Ismail |
author_facet |
Mohsine, Ismail Kacimi, Ilias Valles, Vincent Leblanc, Marc El Mahrad, Badr Dassonville, Fabrice Kassou, Nadia Bouramtane, Tarik Abraham, Shiny Touiouine, Abdessamad Jabrane, Meryem Touzani, Meryem Barry, Abdoul Azize Yameogo, Suzanne Barbiero, Laurent |
author_role |
author |
author2 |
Kacimi, Ilias Valles, Vincent Leblanc, Marc El Mahrad, Badr Dassonville, Fabrice Kassou, Nadia Bouramtane, Tarik Abraham, Shiny Touiouine, Abdessamad Jabrane, Meryem Touzani, Meryem Barry, Abdoul Azize Yameogo, Suzanne Barbiero, Laurent |
author2_role |
author author author author author author author author author author author author author author |
dc.contributor.none.fl_str_mv |
Sapientia |
dc.contributor.author.fl_str_mv |
Mohsine, Ismail Kacimi, Ilias Valles, Vincent Leblanc, Marc El Mahrad, Badr Dassonville, Fabrice Kassou, Nadia Bouramtane, Tarik Abraham, Shiny Touiouine, Abdessamad Jabrane, Meryem Touzani, Meryem Barry, Abdoul Azize Yameogo, Suzanne Barbiero, Laurent |
dc.subject.por.fl_str_mv |
Groundwater bodies Machine learning Discriminant analysis Chemical composition Bacteriological composition PACA region France |
topic |
Groundwater bodies Machine learning Discriminant analysis Chemical composition Bacteriological composition PACA region France |
description |
In order to facilitate the monitoring of groundwater quality in France, the groundwater bodies (GWB) in the Provence-Alpes-Cote d'Azur region have been grouped into 11 homogeneous clusters on the basis of their physico-chemical and bacteriological characteristics. This study aims to test the legitimacy of this grouping by predicting whether water samples belong to a given sampling point, GWB or group of GWBs. To this end, 8673 observations and 18 parameters were extracted from the Size-Eaux database, and this dataset was processed using discriminant analysis and various machine learning algorithms. The results indicate an accuracy of 67% using linear discriminant analysis and 69 to 83% using ML algorithms, while quadratic discriminant analysis underperforms in comparison, yielding a less accurate prediction of 59%. The importance of each parameter in the prediction was assessed using an approach combining recursive feature elimination (RFE) techniques and random forest feature importance (RFFI). Major ions show high spatial range and play the main role in discrimination, while trace elements and bacteriological parameters of high local and/or temporal variability only play a minor role. The disparity of the results according to the characteristics of the GWB groups (geography, altitude, lithology, etc.) is discussed. Validating the grouping of GWBs will enable monitoring and surveillance strategies to be redirected on the basis of fewer, homogeneous hydrogeological units, in order to optimize sustainable management of the resource by the health agencies. |
publishDate |
2023 |
dc.date.none.fl_str_mv |
2023 2023-01-01T00:00:00Z 2024-01-18T10:55:28Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.1/20311 |
url |
http://hdl.handle.net/10400.1/20311 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
10.3390/hydrology10120230 2306-5338 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
MDPI |
publisher.none.fl_str_mv |
MDPI |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799137056129024000 |