Content area

Abstract

Groundwater samples were collected from 45 wells across three regions in the United Arab Emirates (UAE)—Jabel Hafeet, Fujairah, and Ras Al Khaimah; their major-ion chemistry and radon activity were analyzed to characterize hydrogeochemical facies. Machine learning (ML) techniques were employed to impute the missing chloride and sulfate concentrations for any missing samples. In this regard, the best models accuracy wise were optimized Random Forest and Extra-trees. The resultant complete dataset was subjected to unsupervised K-means clustering. Mapping the clusters using GeoZ library revealed distinct spatial patterns related to different geological settings. Most Fujairah and Ras Al Khaimah samples clustered together, indicating aquifer similarity, while the Jabel Hafeet samples clustered separately. Several Jabel Hafeet surface water samples were clear outliers. Within the clusters, radon exhibited variation related to groundwater source and could be a useful environmental tracer. The study demonstrates that machine learning could be used to extract meaningful information from incomplete geoscience data. Major findings were the hydrogeochemical similarities between the Fujairah and Ras Al Khaimah aquifers and their differences with the Hafeet aquifer, identification of the Jabel Hafeet surface water samples, and utility of radon in environmental tracing. This research provides valuable insights into major UAE aquifers and the ability of artificial intelligence to boost the value of imperfect datasets.

Highlights

Machine learning can predicts missing hydrogeochemical data, enhancing groundwater insights.

Radon is a valuable tracer for tracking groundwater sources in arid regions.

Fujairah and Ras Al Khaimah aquifers exhibit similar water chemistry profiles.

Details

1009240
Business indexing term
Title
Leveraging machine learning to extract insights and spatial patterns from hydrogeochemical datasets for major groundwater regions in the UAE
Author
ElHaj, Khalid 1   VIAFID ORCID Logo  ; Alshamsi, Dalal 1   VIAFID ORCID Logo  ; Alblooshi, Balqees 2 ; Haile, Fatima 3 ; AlRashdi, Shamma 3 ; Elabyad, Basant 4 

 United Arab Emirates University, Department of Geosciences, Al Ain, United Arab Emirates (GRID:grid.43519.3a) (ISNI:0000 0001 2193 6666); United Arab Emirates University, National Water and Energy Center, Al Ain, United Arab Emirates (GRID:grid.43519.3a) (ISNI:0000 0001 2193 6666) 
 United Arab Emirates University, Department of Geosciences, Al Ain, United Arab Emirates (GRID:grid.43519.3a) (ISNI:0000 0001 2193 6666) 
 United Arab Emirates University, Statictics and Business Analytics Department, Al Ain, United Arab Emirates (GRID:grid.43519.3a) (ISNI:0000 0001 2193 6666) 
 United Arab Emirates University, Department of Chemistry, Al Ain, United Arab Emirates (GRID:grid.43519.3a) (ISNI:0000 0001 2193 6666) 
Publication title
Volume
7
Issue
7
Pages
764
Publication year
2025
Publication date
Jul 2025
Publisher
Springer Nature B.V.
Place of publication
London
Country of publication
Netherlands
Publication subject
ISSN
25233963
e-ISSN
25233971
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-07-11
Milestone dates
2025-06-20 (Registration); 2024-10-30 (Received); 2025-06-20 (Accepted)
Publication history
 
 
   First posting date
11 Jul 2025
ProQuest document ID
3229410772
Document URL
https://www.proquest.com/scholarly-journals/leveraging-machine-learning-extract-insights/docview/3229410772/se-2?accountid=208611
Copyright
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-07-12
Database
ProQuest One Academic