Clustering and Classification Techniques in the Presence of Outliers: An Application to the Johannesburg Stock Exchange Stocks
dc.contributor.author | Maphalla, Retsebile | |
dc.contributor.supervisor | Chipoyera, HW | |
dc.date.accessioned | 2024-10-28T13:00:16Z | |
dc.date.available | 2024-10-28T13:00:16Z | |
dc.date.issued | 2024 | |
dc.description | A dissertation submitted to the Faculty of Science, University of the Witwatersrand, in partial fulfillment of the requirements for the degree of Master of Science, Johannesburg 2024 | |
dc.description.abstract | In this study, the impact of outliers on clustering using the K-means algorithm was explored. It was observed that a high prevalence of outliers can seriously compromise the results of clustering. A novel algorithm called Clustering-quality-aided outlier detection (CQAOD) is proposed in this study. The novelty stems from the fact that apart from identifying outliers, good quality clustering is achieved and the “optimal” number of clusters for K-means clustering of multivariate Gaussian data is simultaneously proffered. In the case of the Johannesburg Stock Exchange (JSE) data, an investigation to compare the efficacy of the following clustering techniques: Hierarchical clustering, spectral clustering, Clustering Large Applications (Clara), Density-based spatial clustering of applications with noise (DBSCAN) was done with the aim of constructing a diversified stock portfolio. The study found that the hierarchical clustering algorithm is the best algorithm to cluster the shares on the JSE | |
dc.description.submitter | MM2024 | |
dc.faculty | Faculty of Science | |
dc.identifier | https://orcid.org/ 0000-0003-1351-1822 | |
dc.identifier.citation | Maphalla, Retsebile. (2024). Clustering and Classification Techniques in the Presence of Outliers: An Application to the Johannesburg Stock Exchange Stocks [Master’s dissertation, University of the Witwatersrand, Johannesburg]. WireDSpace. | |
dc.identifier.uri | https://hdl.handle.net/10539/42024 | |
dc.language.iso | en | |
dc.publisher | University of the Witwatersrand, Johannesburg | |
dc.rights | © 2024 University of the Witwatersrand, Johannesburg. All rights reserved. The copyright in this work vests in the University of the Witwatersrand, Johannesburg. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of University of the Witwatersrand, Johannesburg. | |
dc.school | School of Statistics and Actuarial Science | |
dc.subject | Clustering | |
dc.subject | Classification | |
dc.subject | K-means | |
dc.subject | Multivariate | |
dc.subject | CQAOD | |
dc.subject | UCTD | |
dc.subject.other | SDG-8: Decent work and economic growth | |
dc.title | Clustering and Classification Techniques in the Presence of Outliers: An Application to the Johannesburg Stock Exchange Stocks | |
dc.type | Thesis |