Bayesian Belief Networks and Score-Based Network Construction for Nitrate and Nitrite Level Prediction in Water Quality Testing of the Vaal River in South Africa

dc.contributor.authorStylianou, Nicholas
dc.contributor.supervisorEvans, Mary
dc.date.accessioned2025-10-01T12:53:39Z
dc.date.issued2024-09
dc.descriptionA dissertation submitted in fulfilment of the requirements for the Degree in Master of Science in Geography, to the Faculty of Science, School of Geography, Archaeology and Environmental Studies, University of the Witwatersrand, Johannesburg, 2024
dc.description.abstractThis research addresses the essential task of monitoring water quality, particularly in accurately predicting nitrate and nitrite levels in river water. These parameters are crucial indicators of water's suitability for human consumption and ecological health. Traditional monitoring methods, such as manual sampling and expensive sensors, are often prohibitive due to their high costs and labour-intensive nature. In response to these challenges, this research leverages machine learning (ML) algorithms to more efficiently predict nitrate and nitrite levels, utilising a dataset from one water monitoring station on the Vaal River comprising 1071 data points gathered over 21 years (1995-2016). This dataset includes a variety of water quality variables such as calcium (Ca), chloride (Cl), dissolved major salts (DMS), electrical conductivity (EC), fluoride (F), potassium (K), magnesium (Mg), ammonium (NH4), sodium (Na), phosphate (PO4), sulphate (SO4), silicon (Si), total alkalinity (TAL), and temperature. This research employs a series of Score-Based Constructed Bayesian Belief Network models, including Tree-Based All-Features (AF), Hill Climb (HC) with various criteria like Bayesian Information Criterion (BIC) and Random Features (RF), and a Random Forest model with All-Features with varying number of trees, including optimisations. These models are evaluated for their ability to predict nitrate and nitrite values, aiming to provide a cost-effective and efficient alternative to traditional monitoring techniques. The research assesses each model's prediction accuracy, striving for reliable predictions of nitrate and nitrite levels with the Akaike Information Criterion and Enhanced Bayesian Information Criterion Score-Based models performing with the best accuracy and capable of predicting nitrate and nitrite with Root Mean Square Error (RMSE) on the test data of 0.356 and 0.360 respectively, Mean Square Error (MSE) on the test data of 0.127 and 0.130 respectively, Mean Absolute Error (MAE) on the test data of 0.197 and 0.197 respectively and R-squared of 0.7940 and 0.7944 respectively. The study's findings have significant implications for monitoring and managing water resources, potentially shifting the paradigm for ensuring water quality sustainably.
dc.description.submitterMMM2025
dc.facultyFaculty of Science
dc.identifier0009-0001-6628-7011
dc.identifier.citationStylianou, Nicholas. (2024). Bayesian Belief Networks and Score-Based Network Construction for Nitrate and Nitrite Level Prediction in Water Quality Testing of the Vaal River in South Africa. [Master's dissertation, University of the Witwatersrand, Johannesburg]. WIReDSpace. https://hdl.handle.net/10539/46725
dc.identifier.urihttps://hdl.handle.net/10539/46725
dc.language.isoen
dc.publisherUniversity of the Witwatersrand, Johannesburg
dc.rights©2024 University of the Witwatersrand, Johannesburg. All rights reserved. The copyright in this work vests in the University of the Witwatersrand, Johannesburg. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of University of the Witwatersrand, Johannesburg.
dc.rights.holderUniversity of the Witwatersrand, Johannesburg
dc.schoolSchool of Geography, Archaeology and Environmental Studies
dc.subjectBayesian belief networks
dc.subjectBBN
dc.subjectWater quality
dc.subjectScore-based networks nitrate
dc.subjectNitrite
dc.subjectUCTD
dc.subject.primarysdgSDG-9: Industry, innovation and infrastructure
dc.subject.secondarysdgSDG-6: Clean water and sanitation
dc.titleBayesian Belief Networks and Score-Based Network Construction for Nitrate and Nitrite Level Prediction in Water Quality Testing of the Vaal River in South Africa
dc.typeDissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Stylianou_Bayesian_2024.pdf
Size:
3.45 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.43 KB
Format:
Item-specific license agreed upon to submission
Description: