Repository logo
Communities & Collections
All of WIReDSpace
  • English
  • العربية
  • বাংলা
  • Català
  • Čeština
  • Deutsch
  • Ελληνικά
  • Español
  • Suomi
  • Français
  • Gàidhlig
  • हिंदी
  • Magyar
  • Italiano
  • Қазақ
  • Latviešu
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Srpski (lat)
  • Српски
  • Svenska
  • Türkçe
  • Yкраї́нська
  • Tiếng Việt
Log In
New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Research Data
  3. Browse by Author

Browsing by Author "Achilonu, Okechinyere Juliet"

Filter results by typing the first few letters
Now showing 1 - 2 of 2
  • Results Per Page
  • Sort Options
  • Thumbnail Image
    Item
    Knowledge discovery through mining mixed data types for cancer diagnostic and prognostic prediction modelling
    (2022) Achilonu, Okechinyere Juliet
    Background: Cancer is an increasingly multidimensional global health problem that demands continuous and concerted efforts toward diagnosis, treatment and management. Developing countries, including South Africa, are expected to experience an increasing burden of cancer morbidity and mortality because of a late-stage diagnosis, poor awareness, lack of biomedical screening and limited resources for treatment. Nonetheless, there is a growing trend to improve treatment outcomes by understanding the epidemiology and dynamics of cancer. The multifaceted clinical approach toward cancer has resulted in an unprecedented production of a large amount of complex and heterogeneous clinical data. These include information describing disease symptoms, diagnostics reports, drug usage and clinical outcomes, often presented in a structured, semi-structured and unstructured data format. Physicians’ intuition may not efficiently integrate these mixed data types in this situation for a realistic prognosis and treatment. Therefore, the next generation of cancer treatments may be found not by empirical science but by data scientists developing mathematical and statistical models to describe the nature, epidemiology and dynamics of cancer at various stages of the disease. Hence, cancer research is attracting the application of high-performance computing and big data analytics to enhance diagnosis, prognosis, and treatment. It has been shown that the use of data mining (DM), text mining (TM) and machine learning (ML) has efficiently and effectively uncovered trends from cancer data to support oncologists in decision-making. However, Africa is yet to embrace clinical decision-making in cancer diagnosis and prognosis using these algorithms. Current ML models for cancer detection and prognostic classifications have been chiefly designed for developed countries. Nevertheless, findings in the developed countries may not truly reflect the situation in an African population with higher diversity. Aim: This research aimed to contribute to knowledge discovery using descriptive and predictive analytics, specifically in colorectal, breast and prostate cancers. One of the objectives of this study is to develop and evaluate methods for extracting relevant information to classify cancer pathology reports. This will be achieved by analysing a large pool of historical pathology reports and assessing performance across cancer types. Secondly, there is only limited study on biological and clinical parameters which drives cancer prognosis, especially in Africa. This study aimed to develop a rule-based method to automatically extract important cancer prognostic parameters from free-text pathology reports, transform them into structured data, and uncover the trend of these parameters over the years. In addition, studies have shown that hospital length of stay (LOS) following surgical cancer resection varies across countries. However, cancer studies on hospital length of stay are unknown in an African population. This study aimed to present an avenue to understand the dynamics and prognostic classification of patient risk groups post-surgical recession. Finally, cancer recurrence and patient survival studies are limited in an African population. We explored the feasibility of integrating statistical and machine learning algorithms for the first time to achieve higher predictive performance and interpretability models for cancer recurrence and patient survival. Method: We conducted secondary data analysis using two data sources based on the study aims. The first data set consists of 181,000 breast, colorectal and prostate cancer patients whose diagnostic tests were carried out at the National Health Laboratory Services (NHLS) from 2008 to 2019. Each patient is described with demographic information, pathology reports, SNOMED morphology and topography codes. The data set addressed this study’s first and second objectives, focusing on information extraction from pathology reports, text classification and prediction, and trend analysis. The second data set consisted of 761 colorectal cancer patients diagnosed in Johannesburg. The patients were treated in private and public hospitals from 2015 to 2019 and followed until 2020. Each patient is described with more features detailing their demographic, clinical and histological information. We used this data set to address the third and fourth objectives, which focused on prognostic classification and prediction, with machine learning and statistical algorithms Results: The incidence of breast, colorectal and prostate cancer diagnostic tests trends is increasing over the year. In recent years, pathology reporting on these cancers has increased in text length, with more malignancy than benign. The result also showed inconsistencies and incompleteness in reporting each year and across the year of study. However, our methodologies could standardise and accurately extract essential parameters with high performance. The second objective showed that the developed rule-based method achieved high accurate annotation for all the parameters extracted, with performance measures ranging from 83% -100%. The trend analysis result showed significant trends in the proportion of molecular subtypes and Ki67, comparable to previous studies. Further, we observed that the median hospital LOS post-surgical resection was higher than those reported in the developed world. There is no significant difference in hospital LOS when comparing private and public facilities. Factors predisposing patients to prolonged hospital LOS include preoperative, perioperative and postoperative parameters. However, surgical complications are still the primary driver of prolonged hospital LOS. Finally, younger patients experienced higher recurrence than older patients but had a comparable survival rate. There was an improved survival for patients treated in a private hospital compared to those treated in public hospitals. Conclusion: We developed reproducible frameworks that can form the basis for future studies in South Africa, using DM, TM and ML algorithms. This study supports a nationally agreed standard in pathology reporting and the use of these algorithms for encoding, classifying, and producing high-quality information abstractions for cancer diagnosis, prognosis, incidence reporting, and research. The association established in this study may enable clinicians to understand the diagnostic and prognostic factors that influence patients’ health status and implement changes in patient care pathways. Clinicians can employ this type of study as a simple and quick test to flag patients at high risk. Such a strategy would improve clinical outcomes, but it is also likely to improve efficiency, favourably impacting the cost of care for cancer patients. The findings of this study can be generalised not only to the population of South African cancer patients but in other Sub-Saharan African countries with similar trends in urbanisation and dynamics in cancer epidemiology
  • Thumbnail Image
    Item
    Modelling graft survival after kidney transplantation using semi-parametric and parametric survival models
    (2017) Achilonu, Okechinyere Juliet
    This study presents survival modelling and evaluation of risk factors of graft survival in the context of kidney transplant data generated in South Africa. Beyond the Kaplan-Meier estimator, the Cox proportional hazard (PH) model is the standard method used in identifying risk factors of graft survival after kidney transplant. The Cox PH model depends on the proportional hazard assumption, which is rarely met. Assessing and accounting for this assumption is necessary before using this model. When the PH assumption is not valid, modi cation of the Cox PH model could o er more insight into parameter estimates and the e ect of time-varying predictors at di erent time points. This study aims to identify the survival model that will e ectively describe the study data by employing the Cox PH and parametric accelerated failure time (AFT) models. To identify the risk factors that mediate graft survival after kidney transplant, secondary data involving 751 adults that received a single kidney transplant in Charlotte Maxeke Johannesburg Academic Hospital between 1984 and 2004 was analysed. The graft survival of these patients was analysed in three phases (overall, short-term and long-term) based on the follow-up times. The Cox PH and AFT models were employed to determine the signi cant risk factors. The purposeful method of variable selection based on the Cox PH model was used for model building. The performance of each model was assessed using the Cox-Snell residuals and the Akaike Information Criterion. The t of the appropriate model was evaluated using deviance residuals and the delta-beta statistics. In order to further assess how appropriately the best model t the study data for each time period, we simulated a right-censored survival data based on the model parameter-estimates. Overall, the PH assumption was violated in this study. By extending the standard Cox PH model, the resulting models out-performed the standard Cox PH model. The evaluation methods suggest that the Weibull model is the most appropriate in describing the overall graft survival, while the log-normal model is more reasonable in describing short-and long-term graft survival. Generally, the AFT models out-performed the standard Cox regression model in all the analyses. The simulation study resulted in parameter estimates comparable with the estimates from the real data. Factors that signi cantly in uenced graft survival are recipient age, donor type, diabetes, delayed graft function, ethnicity, no surgical complications, and interaction between recipient age and diabetes. Statistical inferences made from the appropriate survival model could impact on clinical practices with regards to kidney transplant in South Africa. Finally, limitations of the study are discussed in the context of further studies.

DSpace software copyright © 2002-2025 LYRASIS

  • Privacy policy
  • End User Agreement
  • Send Feedback
Repository logo COAR Notify