Electronic Theses and Dissertations (Masters)
Permanent URI for this collectionhttps://hdl.handle.net/10539/38009
Browse
2 results
Search Results
Item Assessing and comparing the performance of different machine learning regression algorithms in predicting Chlorophyll-a concentration in the Vaal Dam, Gauteng(University of the Witwatersrand, Johannesburg, 2024-03) Mahamuza, Phemelo Hope; Adam, ElhadiThe state of Vaal Dam is influenced by various land uses surrounding the Dam, including agricultural activities, mining operations, industrial enterprises, urban settlements, and nature reserves. Mining activities, farming practices, and sewage outflows from nearby villages led to access contamination within the Dam, increasing algal bloom levels. Sentinel-2 MSI data were utilized to forecast and comprehend the spatial pattern of Chlorophyll-a concentration, indicating algal bloom occurrence in the Vaal Dam. Targeting Sentinel-2 Level-1C, the image was preprocessed on the Google Earth Engine (GEE) with acquisition dates from 25 – 26 October 30, 2016, corresponding to the on-site data collection between October 26 and October 28, 2016. Due to limited resources, up-to-date data on the Vaal Dam could not be collected. However, since this study focuses on applying various machine learning regression models to predict chlorophyll-a levels in waterbodies, the dataset is used to test the models rather than reflect the current state of the Vaal Dam. The dataset, comprising 23 samples, was divided into 70% training and 30% test sets, allowing for comprehensive model evaluation. Band ratio reflectance values were extracted from the satellite image and correlated with in-field Chlorophyll-a values. The highest correlation coefficient values were utilized to train five machine-learning models employed in this study: Random Forest (RF), Support Vector Regression (SVR), Least Absolute Shrinkage and Selection Operator (LASSO), Ridge Regression, and Multilinear Regression (MLR). Each model underwent training with ten iterations each; the best learning iteration was then used to generate the final Chlorophyll-a predictive model. The predictive models were validated using the Sentinel-2 MSI satellite data and in-situ measurements using R2, RMSE, and MAPE. Among the five machine learning algorithms trained, RF performed the best, with an R2 of 0.86 and 0.95, an RMSE of 1.38 and 0.8, and MAPE of 15.09% and 10.92% for the training and testing sets, respectively, indicating its ability to handle small, non-linear datasets. SVR also demonstrated a fair performance, particularly in handling multicollinearity in the data points with an R2 of 0.68 and 0.87, an RMSE of 2.37 and 1.56, and MAPE of 18.13% and 19.28% for the training and testing sets, respectively. The spatial pattern of Chlorophyll-a concentrations, mapped from the RF model, indicated that high concentrations of Chlorophyll-a are along the Dam shorelines, suggesting a significant impact of land use activities on pollution levels. This study emphasizes the importance of selecting suitable machine learning algorithms tailored to the dataset's characteristics. RF and SVR demonstrated proficiency in handling nonlinearity, with RF displaying enhanced generalization and resistance to overfitting. Limited field data evenly distributed across the Dam and satellite overpass dates may affect result accuracy. Future research should align satellite pass dates with fieldwork dates and ensure an even distribution of in-field samples across the Dam to represent all land uses and concentration levels.Item Mapping and monitoring land transformation of Boane district, Mozambique (1980 – 2020), using remote sensing(University of the Witwatersrand, Johannesburg, 2023) Dengo, Claudio Antonio; Atif, Iqra; Adam, ElhadiAlthough natural and environmental factors play a significant role in land transformation, human actions dominate. Therefore, to better understand the present land uses and predict the future, accurate information describing the nature and extent of changes over time is necessary and critical, especially for developing countries. It is estimated that these countries will account for 50% of the world's population growth in the next few years. Hence, this research was an attempt to assess and monitor land cover changes in Boane, Mozambique, over the past 40 years and predict what to expect in the next 30 years. This district has been challenged by a fast-growing population and land use dynamic, with quantitative information, driving forces and impacts remaining unknown. Through a supervised process in a cloud base Google Earth Engine platform, a set of five Landsat images at ten-year intervals were classified using a random forest algorithm. Seven land classes, i.e., agriculture, forest, built-up, barren, rock, wetland and water bodies, were extracted and compared through a pixel-by-pixel process as one of the most precise and accurate methods in remote sensing and geographic information system applications. The results indicate an active alternate between all land classes, with significant changes observed within agriculture, forest and build-up classes. As it is, while agriculture (-26.1%) and forest (-21.4%) showed a continuously decreasing pattern, build-up class (45.8%) increased tremendously. Consequently, over 69% of the forest area and 59% of the agricultural area shifted into build-up, i.e., was degraded or destroyed. Similarly, the conversion of barren land area (57.2%) and rock area (47.3%) into build-up indicates that those areas were cleaned. The overall classification accuracy averaged 90% and a kappa coefficient of 0.8779 were obtained. The CA-Markov model, used to assess future land uses, indicates that build-up will continue to increase significantly, covering 60% of the total area. From this finding, the land cover situation in the next 30 years will be critical if no action is taken to stop this uncontrolled urban sprawl. An adequate land use plan must be drawn, clearly indicating the locations for different activities and actions for implementation.