American Journal of Water Resources
ISSN (Print): 2333-4797 ISSN (Online): 2333-4819 Website: https://www.sciepub.com/journal/ajwr Editor-in-chief: Apply for this position
Open Access
Journal Browser
Go
American Journal of Water Resources. 2025, 13(3), 86-96
DOI: 10.12691/ajwr-13-3-3
Open AccessArticle

Predictive Modelling of Groundwater Quality in the Nakanbé River Basin Using Machine Learning Techniques

Issoufou OUEDRAOGO1, 2, , W. J. P. SANDWIDI1, 2, Fatoumata KABORE3, Mahamadou KONARE4 and Cheick Abdramane OUATTARA3

1Mining Engineering Department, University Yembila-Abdoulaye-TOGUYENI, BP 54 Fada N' Gourma, Burkina Faso

2Geosciences and Environment Laboratory (LaGE). Joseph KI-ZERBO University, BP 7021, Ouagadougou, Burkina Faso

3Direction de la Qualité des Eaux/Ministère de l'Environnement de l'Eau et de l'Assainissement, Burkina Faso

4Laboratoire Sciences et Technologie (LaST), Université Thomas SANKARA, 12 BP 417 Ouagadougou 12, Burkina Faso

Pub. Date: August 28, 2025

Cite this paper:
Issoufou OUEDRAOGO, W. J. P. SANDWIDI, Fatoumata KABORE, Mahamadou KONARE and Cheick Abdramane OUATTARA. Predictive Modelling of Groundwater Quality in the Nakanbé River Basin Using Machine Learning Techniques. American Journal of Water Resources. 2025; 13(3):86-96. doi: 10.12691/ajwr-13-3-3

Abstract

Continuous monitoring of groundwater quality is essential for protecting public health and the environment, particularly in vulnerable regions such as Burkina Faso’s Nakanbé Basin, where groundwater serves as a primary source of potable water. This study aimed to develop and evaluate machine learning (ML) models to predict two key water quality parameters: Total Dissolved Solids (TDS) and Total Alkalinity (TA), using data provided by the General Directorate of Water Resources (DGRE). A total of 1,765 groundwater samples were analyzed, encompassing nineteen physicochemical parameters. Prior to modelling, multicollinearity analysis was conducted to ensure the reliability of the input variables. Three regression algorithms Random Forest Regression (RFR), Multiple Linear Regression (MLR), and Decision Tree Regression (DTR) were compared for their predictive performance. Among them, Random Forest demonstrated the highest accuracy, with the highest R² and lowest error metrics (MAE, RMSE) across both training and testing datasets for both TDS and TA. While MLR offered consistent and interpretable results, particularly for TA, DTR exhibited strong overfitting, with lower generalizability on test data. The results highlight the superiority of ensemble learning approaches, particularly RFR, in capturing complex, nonlinear relationships within groundwater quality datasets. ML application in this context provides a cost-effective and scalable alternative to conventional laboratory-based monitoring methods. It also enables the identification of influential water quality parameters, supports risk assessment of contamination, and contributes to evidence-based water resource management strategies. These findings demonstrate the potential of ML tools to enhance groundwater monitoring and advance sustainable water governance in arid and semi-arid regions.

Keywords:
Machine Learning Regression Groundwater Quality Parameters Nakanbé basin Burkina Faso

Creative CommonsThis work is licensed under a Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

References:

[1]  Xu, Y., and Usher, B. (2006). Groundwater pollution in Africa, Taylor & Francis/Balkema, The Netherlands. 353pp.
 
[2]  MacDonald, A., M., R. Taylor, G. and H. Bonsor, C.,2013. (eds.) Groundwater in Africa - is there sufficient water to support the intensification of agriculture from “Land Grabs”." Hand book of land and water grabs in Africa. pp 376-383.
 
[3]  Li, P., Karunanidhi, D., Subramani, T., & Srinivasamoorthy, K. (2021). Sources and consequences of groundwater contamination. Archives of environmental contamination and toxicology, 80(1), 1-10.
 
[4]  Su, F., Wu, J. & He, S. (2019). Set pair analysis-Markov chain model for groundwater quality assessment and prediction: a case study of Xi’an city, China. Human and Ecological Risk Assessment.
 
[5]  SDAGE - Schéma Directeur d’Aménagement et de Gestion de l’Eau de l’Agence de l’Eau du Nakanbé (2015). Tome I. Etat des lieux. Version validée par le Comité de Bassin Novembre 2015 https:// eaunakanbe.bf/ wp-content/uploads/2019/06/Etat-des-lieux-actualis%C3%A9-des-RE-du-Nakanb%C3%A9-_Tome-1-SDAGE.pdf.
 
[6]  Millogo, D., Bazié, M.M., Koussoubé, Y., Zombré, P.N. and Da, E.C.D. (2018). Assessment of Agricultural and Mining Pollutions of Waterbodies within the Nakanbé Basin (Burkina Faso): The Case of the Goinré, Ziga and Bagré Reservoirs. Journal of Water Resource and Protection, 10, 41-58.
 
[7]  Silga, R. P., Ouedraogo, I., Kabore, I., Sirima, D., Mano, K., Bance, V.& Gneme, A. (2022). Évaluation de la qualité des eaux de surface basée sur les paramètres physico-chimiques des eaux et les macroinvertébrés: cas du réservoir de Loumbila. Sciences Naturelles et Appliquées, 41(2 (1)), 57-73.
 
[8]  Somé, K., Dembélé, Y., Somé, L. and Millogo-Rasolodimby, J. (2008). Pollution agricole des eaux dans le bassin du Nakanbé : Cas des réservoirs de Loumbila et de Mogtédo au Burkina Faso. Sud Sciences et Technologies. Semestriel, 16, 14-22.
 
[9]  Vasanthavigar, M., Srinivasamoorthy, K., Rajiv Gantha, R., Vijayaraghavan, K., & Sarma, V. S. (2010). Characterization and quality assessment of groundwater with special emphasis on irrigation utility: Thirumanimuttar sub-basin, Tamil Nadu, India. Arab. Geosci J.
 
[10]  Zahedi, S. (2017). Modification of expected conflicts between Drinking Water Quality Index and Irrigation Water Quality Index in water quality ranking of shared extraction wells using Multi Criteria Decision Making techniques. Ecological Indicators.
 
[11]  Kumari, M. & Rai, S. C. 2020 Hydrogeochemical evaluation of groundwater quality for drinking and irrigation purposes using water quality index in Semi Arid Region of India. Journal of the Geological Society of India 95(2), 159-168.
 
[12]  Maghrebi, M., Noori, R., Partani, S., Araghi, A., Barati, R., Farnoush, H. & Torabi Haghighi, A. 2021 Iran’s groundwater hydrochemistry. Earth and Space Science 8(8), 1-18.
 
[13]  Kumar, V. S., Amarender, B., Dhakate, R., Sankaran, S. & Raj Kumar, K. 2016 Assessment of groundwater quality for drinking and irrigation use in shallow hard rock aquifer of Pudunagaram, Palakkad District Kerala. Applied Water Science 6(2), 149-167.
 
[14]  El-Rawy, M., Batelaan, O., Alshehri, F., Almadani, S., Ahmed, M.S., Elbeltagi, A. (2023). An Integrated GIS and Machine-Learning Technique for Groundwater Quality Assessment and Prediction in Southern Saudi Arabia. Water 15(13): 2448.
 
[15]  Amiri, V, Nakagawa K (2021). Using a linear discriminant analysis (LDA)-based nomenclature system and self-organizing maps (SOM) for spatiotemporal assessment of groundwater quality in a coastal aquifer. J Hydrol 603: 127082.
 
[16]  Elnazer, A.A, Salman, S.A, Mohamed, Y.M., Staford, J., Davies, P., El Nazer, H.A. (2023). Siwa oasis groundwater quality: factors controlling spatial and temporal changes. Environ Monit Assess 195(1): 61.
 
[17]  Kaur, T., Bhardwaj R, Arora S (2017). Assessment of groundwater quality for drinking and irrigation purposes using hydrochemical studies in Malwa region, southwestern part of Punjab, India. Appl Water Sci 7: 3301-3316.
 
[18]  Sargazi, S., Mokhtari, M., Ehrampoush, M.H., Almodaresi, S.A., Sargazi, H., Sarhadi, M. (2021). The application of geographical information system (GIS) approach for assessment of groundwater quality of Zahedan city, Sistan and Baluchestan Province, Iran. Groundw Sustainable Dev 12: 100509.
 
[19]  Thakur D, Sharma A, Goel P, Thakur A, Raturi M (2023) Groundwater quality assessment in the alluvial region of upper yamuna basin, India. Groundw Sustainable Dev 22: 100969.
 
[20]  Wang, X., Xiao, C., Yang, W., Liang, X., Zhang, L., Zhang, J. (2023). Analysis of the quality, source identifcation and apportionment of the groundwater in a typical arid and semi-arid region. J Hydrol 625: 130169.
 
[21]  Kabore, F. (2024). Caractérisation de l’impact des seuils d’épandage sur la recharge des eaux souterraines dans un contexte de bas-fonds semi-aride : application au sous bassin de Nariaré (Burkina Faso). Universite de Liège (Belgium). ProQuest Dissertations & Theses, 2024. 31855857.
 
[22]  Sakizadeh, M. (2016). Artifcial intelligence for the prediction of water quality index in groundwater systems. Model Earth Syst Environ 2:1-9.
 
[23]  Chaplot, B. (2021) Prediction of rainfall time series using soft computing techniques. Environ Monit Assess 193(11): 721.
 
[24]  Latif, S.D., Hazrin NAB, Koo CH, Ng JL, Chaplot B, Huang YF, Ahmed, A.N., (2023). Assessing rainfall prediction models: exploring the advantages of machine learning and remote sensing approaches. Alexandria Eng J 82: 16-25.
 
[25]  Fuladipanah, M., Shahhosseini A, Rathnayake N, Azamathulla HM, Rathnayake U, Meddage DPP, Tota-Maharaj K (2024). In-depth simulation of rainfall-runof relationships using machine learning methods. Water Pract Technol 19(6): 2442-2459.
 
[26]  Kisi, O., Azamathulla, H.M., Cevat, F., Kulls, C., Kuhdaragh, M., Fuladipanah, M. (2024). Enhancing river fow predictions: comparative analysis of machine learning approaches in modeling stage-discharge relationship. Results Eng 22: 102017.
 
[27]  Niazkar, M., Zakwan, M., Goodarzi, M.R., Hazi, M.A. (2024). Assessment of climate change impact on water resources using machine learning algorithms. J Water Clim Change 15(6): jwc2024002.
 
[28]  Sahu, M., Mahapatra, S.S., Sahu, H.B., Patel, R.K. (2011). Prediction of water quality index using neuro fuzzy inference system. Water Qual Exposure Health 3: 175-191.
 
[29]  Seyam, M., Mogheir, Y., (2011). Application of artifcial neural networks model as analytical tool for groundwater salinity. J Environ Prot 2(01): 56.
 
[30]  Gholami, V., Khaleghi, M.R., Sebghati, M. (2017). A method of groundwater quality assessment based on fuzzy network-CANFIS and geographic information system (GIS). Appl Water Sci 7(7): 3633- 3647.
 
[31]  Gholami, V., Sebghati M, Yousef Z., (2016). Integration of artifcial neural network and geographic information system applications in quality groundwater simulating 182-173: 43.
 
[32]  Nathan, N.S, Saravanane, R., Sundararajan, T. (2017). Application of ANN and MLR models on groundwater quality using CWQI at Lawspet, Puducherry in India. J Geoscience Environ Prot 5(03): 99.
 
[33]  Asadollah, S.B.H.S., Jodar-Abellan, A., Pardo, M.Á. (2024). Optimizing machine learning for agricultural productivity: a novel approach with RScv and remote sensing data over Europe. Agric Syst 218: 103955.
 
[34]  Judeh, T., Almasri, M.N., Shadeed, S.M. et al. (2022). Use of GIS, Statistics and Machine Learning for Groundwater Quality Management: Application to Nitrate Contamination. Water Resour 49, 503-514 (2022).
 
[35]  Gbohoui, Y. P., Paturel, J. E., Tazen, F., Mounirou, L. A., Yonaba, R., Karambiri, H., & Yacouba, H. (2021). Impacts of climate and environmental changes on water resources: A multi-scale study based on Nakanbé nested watersheds in West African Sahel. Journal of Hydrology: Regional Studies, 35, 100828.
 
[36]  Ibrahim, B., (2012). Caractérisation des saisons de pluies au burkina faso dans un contexte de changement climatique et évaluation des impacts hydrologiques sur le bassin du nakanbé. PhD Thesis. 2iE et Université Pierre et Marie Curie, Ouagadougou et Paris VI.
 
[37]  West, Thor, C., Benecky, S., Karlsson, C., Reiss, B., & Moody, A. J. (2020). Bottom-up perspectives on the re-greening of the Sahel: An evaluation of the spatial relationship between soil and water conservation (SWC) and tree-cover in Burkina Faso. Land 2020, 9(6), 208.
 
[38]  Ibrahim, B., Karambiri, H., & Polcher, J. (2015). Hydrological impacts of the changes in simulated rainfall fields on Nakanbe Basin in Burkina Faso. Climate, 3(3), 442-458.
 
[39]  Karambiri, H.; García Galiano, S.; Giraldo, J.; Yacouba, H.; Ibrahim, B.; Barbier, B.; Polcher, J. (2011). Assessing the impact of climate variability and climate change on runoff in West Africa: The case of Senegal) and Nakambe River basins. Atmos. Sci. Lett. 2011, 12, 109-115.
 
[40]  World Bank. (2001). Project Appraisal Document on a Proposed Credit in the Amount of SDR 55.5 Million (US$70 Million Equivalent) to Burkina Faso for a Ziga Water Supply Project. Report No: 21884-BUR. Retrieved from https:// ocuments1.worldbank.org/curated/en/419491468743737560/pdf/multi0page.pdf.
 
[41]  IW/LEARN. (2014). Visit at Ziga dam (Burkina Faso) https://fdmt.iwlearn.org/news/visit-at-ziga-dam-burkina-faso (Access 19 June 2025).
 
[42]  Ciampi, L., Plumpton, H. J., Osbahr, H., Cornforth, R. J., & Petty, C. (2022). Building resilience through improving groundwater management for sustainable agricultural intensification in African Sahel. CABI Agriculture and Bioscience, 3(1), 63.
 
[43]  Diello, P. (2007). Interrelations Climat - Homme - Environnement Dans Le Sahel Burkinabé : Impacts Sur Les Etats De Surface Et La Modélisation Hydrologique. Ph.D. Thesis. Montpellier 2-IRD/ 2iE, France/Burkina Faso.
 
[44]  Mohebbi, T., G., & Mohebbi, T., A. (2020). Statistical approaches and hydrochemical modeling of groundwater in the Golpayegan Plain aquifer, Iran. Modeling Earth Systems and Environment, 6(4), 2391-2404.
 
[45]  Zarajabad, A. M., Hadi, M., Nodehi, R. N., Moradi, M., Ghalhari, M. R., Zeraatkar, A., & Mahvi, A. H. (2024). Providing predictive models for quality parameters of groundwater resources in arid areas of central Iran: A case study of kashan plain. Heliyon, 10(11).
 
[46]  Amini, H., Fakheri, F., Dar, J. Y., Shakeri, R., Nejati, H., Lam, M. Y., ... & Ahmadian, R. (2025). TDS prediction with wavelet analysis and trend-seasonal decomposition and machine learning algorithms, Case Study: Karkheh River, Iran. Water Conservation Science and Engineering, 10(2), 63.
 
[47]  Jacintha, T. G. A., Rawat, K. S., Mishra, A., & Singh, S. K. (2017). Hydrogeochemical characterization of groundwater of peninsular Indian region using multivariate statistical techniques. Applied Water Science, 7, 3001-3013.
 
[48]  WHO, (1993). Guidelines for Drinking Water Quality Recommendations. 2nd Edition, Vol. 1, World Health Organization, Geneva, Switzerland.
 
[49]  Nasteski, V. (2017). An overview of the supervised machine learning methods. Horizons. b, 4(51-62), 56.
 
[50]  Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and Regression Trees. Wadsworth.
 
[51]  Breiman, L. Random Forests. Machine Learning 45, 5-32 (2001).
 
[52]  IBM Cloud Education. (2025). What is Random Forest? IBM. https://www.ibm.com/cloud/learn/random-forest (Accessed 16 March 2025).
 
[53]  Kanade, V. (2023). What Is Linear Regression? Types, Equation, Examples, and Best Practices for 2022. https:// www.spiceworks.com/ tech/artificial-intelligence/articles/what-is-linear-regression (Accessed online 27 Juanury 2025).
 
[54]  Minku, L. L., & Yao, X. (2013). Ensembles and locality: Insight on improving software effort estimation. Elsevier, Information and Software Technology, 55(8), 1512-1528.
 
[55]  Idri, A., azzahra Amazal, F., & Abran, A. (2015). Analogy-based software development effort estimation: A systematic mapping and review. Information and Software Technology, 58, 206-230.
 
[56]  Nelyub, V. A., Tynchenko, V. S., Gantimurov, A. P., Degtyareva, K. V., & Kukartseva, O. I. (2023). Correlation Analysis and Predictive Factors for Building a Mathematical Model. In Proceedings of the Computational Methods in Systems and Software (pp. 14-25). Cham: Springer International Publishing.
 
[57]  Sako, A., & Kafando, S. (2021). Hydrogeochemical and spatial assessment of groundwater quality from basement aquifers in the Central Plateau Region of Burkina Faso, West Africa. Environmental Earth Sciences, 80(9), 358.
 
[58]  Sako, A., Ouangaré, C.A.C. (2023). Hydrogeochemical characterization and natural background level determination of selected inorganic substances in groundwater from a semi-confined aquifer in Midwestern Burkina Faso, West Africa. Environ Monit Assess 195, 519 (2023).
 
[59]  Millogo, C., Sako, A., Sagnon, B. and Nakolendousse, S., (2024). Hydrochemical and Spatial Assessment of Groundwater Quality from Fractured Basement Aquifers in the South-Central of Burkina Faso, West Africa.
 
[60]  Kutner, M. H. (2005). Applied linear statistical models. https://thuvienso.thanglong.edu.vn/handle/TLU/12233.
 
[61]  Appelo, C. A. J., & Postma, D. (2005). Geochemistry, Groundwater and Pollution (2nd ed.). CRC Press.
 
[62]  Hem, J. D. (1985). Study and Interpretation of the Chemical Characteristics of Natural Water (3rd ed.). U.S. Geological Survey Water-Supply Paper 2254.
 
[63]  El Bilali, A., Taleb, A. (2020). Prediction of irrigation water quality parameters using machine learning models in a semi-arid environment, Journal of the Saudi Society of Agricultural Sciences 19 (7), 439-451.
 
[64]  Calvert, M.B. (2020). Predicting Concentrations of Selected Ions and Total Hardness in Groundwater Using Artificial Neural Networks and Multiple Linear Regression Models, Duke University.