Model performance optimisation in credit card fraud detection using class imbalance techniques, feature engineering and feature selection techniques

Loading...
Thumbnail Image

Journal Title

Journal ISSN

Volume Title

Publisher

University of the Witwatersrand, Johannesburg

Abstract

Fraud detection in financial datasets, particularly in credit card transactions, presents a significant challenge due to the prevalence of irrelevant features and class imbalances. Addressing these issues is crucial for optimizing model performance and accurately identifying fraudulent activities. This research focuses on the application of feature engineering, class imbalance handling techniques alongside a comparative analysis of feature selection techniques such as Chi-Square, ANOVA, (Recursive Feature Elimination) RFE, and (Information Gain) IG all in bid to find the best combination of techniques that enhance model accuracy in (Credit Card Fraud Detection) CCFD. To mitigate class imbalances, Synthetic Minority Oversampling Technique) SMOTE , (Synthetic Minority Oversampling Technique With Edited Nearest-Neighbours) SMOTE-EEN, and simple oversampling were employed. These methods aimed to balance the class distribution, improving the models’ ability to detect fraud. Popular classification models, including Decision Trees, KNN, AdaBoost, and XGBoost, were trained on datasets that had undergone feature engineering, class imbalance techniques and feature selection all in bid to produce optimized model performances. The study utilized evaluation metrics like F1-score, Balanced Accuracy and ROC-AUC to assess model performance and the results demonstrated how feature engineering, combined with specifically SMOTE-EEN as the class imbalance handling technique alongside strategic ANOVA or Chi-Square Test, significantly improved the accuracy and robustness of the fraud detection models with accuracy scores of over 90% across the four classifiers on the four datasets. These findings will thus help provide valuable insights for industry researchers in selecting the most effective techniques for optimizing model performance in fraud detection studies.

Description

A research report submitted in partial fulfilment of the requirements for the degree of Master of Science - Research Report (e-Science), to the Faculty of Science, School of Computer Science & Applied Mathematics, University of the Witwatersrand, Johannesburg, 2025

Citation

Assabil, Joseph Junior. (2025). Model performance optimisation in credit card fraud detection using class imbalance techniques, feature engineering and feature selection techniques. [Master's dissertation, University of the Witwatersrand, Johannesburg]. WIReDSpace. https://hdl.handle.net/10539/48062

Endorsement

Review

Supplemented By

Referenced By