Classification Modelling: A Case Study of Breast Cancer Patients of Islamabad
Abstract
Purpose: The rate of breast cancer in Pakistan is the highest among all other Asian countries and all other types of cancer. The foremost treatment for breast cancer patients of stage 2 and stage 3 is surgery. The main types of surgery in this era are Mastectomy and Breast Conservative surgery. The decision about the type of surgery depends on the demographic and clinical factors.
Approach: In this study, the seven characteristics have been considered. A purposive sample of 365 breast cancer patients were collected from the two main hospitals in Islamabad. The foremost objective of this study was to classify each breast cancer patient regarding surgery type based on significant explanatory characteristics. The binary logistics regression and discriminant analysis techniques were used and the significance of each parameter was tested.
Findings: The main effects i.e., age, tumor size, Estrogen Receptor, and Progesterone Receptor were found to be significant with some diverse probabilities and all two-factor interactions were found to be non-significant. The sensitivity of logistic regression and discriminant analysis is almost the same i.e., 93.1% and 92.8% respectively whereas the specificity of these two techniques is also almost the same i.e., 70.8% and 71.9% respectively. The overall actual correct classify rate and Apparent error rate of both these techniques are found to be 87.7% and 12.3% respectively.
Implications: In brief, it was deducted that the Tumor size stage is the most imperative characteristic among other significant characteristics in discriminating between two types of surgery
Downloads
Metrics
References
Agresti, A. (2007). An introduction to Categorical data analysis (2nd ed.). A John Wiley & Sons, Inc. DOI: https://doi.org/10.1002/0470114754
Al-Gaithy, Z. K., Yaghmoor, B. E., Koumu, M. I., Alshehri, K. A., Saqah, A. A., & Alshehri, H. Z. (2019). Trend of mastectomy and breast conserving surgery and related factors in female breast cancer patients treated at king Abdul-Aziz University Hospital, Jeddah, Saudi Arabia, 2009-17. A retrospective cohort study. Annals of Medicine and Surgery, 41(10), 47-52. DOI: https://doi.org/10.1016/j.amsu.2019.03.012
American Cancer Society. (2021). Breast cancer Facts and Figure 2020-2021. Atlanta.
Barznji, N. S. (2018). Using logistic regression analysis and linear discriminant analysis to identify the risk factor of diabetes. Zanco Journal of Humanity Sciences, 22(6), 248-268. DOI: https://doi.org/10.21271/zjhs.22.6.17
Balogun, O. S., Akingbade, T. J., & Oguntunde, P. E. (2015). An assessment of the performance of discriminant analysis and the logistic regression methods in classification of Mode of delivery of expectant mother. Journal of Mathematical theory and Modeling, 5(5), 147-154.
Elgohari, H. (2017). Efficiency of discriminant analysis and multivariate logistic regression for the detection of anemic children with chronic kidney disease. International Journal of Statistics and Applications, 7(2), 131-136.
Hair, J. F., Jr., Black, W. C., Babin, B. J., & Anderson, R. E. (2010). Multivariate data analysis (7th ed.). Pearson Prentice Hall.
Hahs-Vaughn, D. L. (2017). Applied multivariate statistical concepts. Routledge. DOI: https://doi.org/10.4324/9781315816685
Hosmer, D. W., Lemeshow, S., & Sturdivant, R. X. (2013). Applied logistic regression (3rd ed.). Johnson Wiley & Sons, Inc. DOI: https://doi.org/10.1002/9781118548387
Huberty, C. j., & Oleinik, S. (2006). Applied MANOVA and Discriminant Analysis (2nd ed.). A John Wiley & Sons, Inc. DOI: https://doi.org/10.1002/047178947X
Johnson, R. A., & Wichern, D. W. (2007). Applied multivariate statistical analysis (6th ed.). Pearson Prentice Hall. DOI: https://doi.org/10.1002/0471667196.ess6094
Liong, C. Y., & Foo, S. F. (2013). Comparison of linear discriminant analysis and logistic regression for data set classification. AIP Conference Proceedings, 1522(1), 1159-1165. DOI: https://doi.org/10.1063/1.4801262
Lo, A. W. (1986). Logit versus discriminant analysis. A specification test and application to corporate bankruptcies. Journal of Econometrics, 31(2), 151–178. DOI: https://doi.org/10.1016/0304-4076(86)90046-1
Mohamed, S., & Shaheen, A. (2019). The robustness of binary logistic regression and linear discriminant analysis for the classification and differentiation of BTV cases in goats, 122–126.
Moiel, D., Thompson, J., & Larsen, K. D. (2019). Mastectomy or Breast-Conserving Therapy: Which Factors Influence A Patient's Decision? The Permanente journal, 23(49), 1185-1195.
Montgomery, M. E., White, M. E., & Martin, S. W. (1987). A comparison of discriminant analysis and logistic regression for the prediction of coliform mastitis in dairy cows. Canadian Journal of Veterinary Research, 51(4), 495-498.
Musa, A. B., Abedairaheem, A. A. A., Hamad, H., & Shaheen, S. M. A. (2019). The robustness of binary logistic regression and linear discriminant analysis for the classification and differentiation of BTV cases in goats. International journal of Multidisciplinary Research and Development, 6(11), 122-126.
National Cancer Institute. (2015). Understanding cancer. Retrieved from www.cancer.gov/ publication.
Penn, C. C. (2020). Comparison of discriminant analysis and logistic regression for classification of borrowers of microfinance institutes. International journal of commerce and management Research, 6(6), 20-25.
Rencher, A. C. (2002). Methods of Multivariate Analysis (2nd ed.). A John Wiley & Sons, Inc. DOI: https://doi.org/10.1002/0471271357
Shayan, Z., Meazerji, N. M. G., Shayan, L. & Naseri, P. (2015). Prediction of depression in cancer patients with different classification criteria, Linear Discriminant Analysis versus Logistic Regression. Global journal of health Science, 8(7), 41-46. DOI: https://doi.org/10.5539/gjhs.v8n7p41
Upton, G. J. (2017). Categorical Data Analysis by Example. A John Wiley & Sons, Inc.
World Health Organization. (2021). Breast Cancer. Geneva: WHO.
World Health Organization. (2021). Cancer. Geneva: WHO.
World Health Organization. (2021). Asia [Fact sheet]. Geneva: WHO.
Zangmo, C. & Tiensuwan, M. (2018). Application of logistic regression models to cancer patients: A case study of data from Jigme Dorji Wangchuck National Referral Hospital in Bhuttan. Journal of Physics: Conference series, 1039(1), 110-115. DOI: https://doi.org/10.1088/1742-6596/1039/1/012031
Copyright (c) 2022 Aansa Abbas, Muhammad Zakria, Muhammad Kashif

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
CSRC Publishing and SBSEE adhere to Creative Commons Attribution-Non Commercial 4.0 International License. The authors submitting and publishing in SBSEE agree to the copyright policy under creative common license 4.0 (Attribution-Non Commercial 4.0 International). Under this license, the authors published in SBSEE retain the copyright including publishing rights of their scholarly work and agree to let others remix, tweak, and build upon their work non-commercially. All other authors using the content of SBSEE are required to cite author(s) and publisher in their work. CSRC Publishing and JAFEE follow an Open Access Policy for copyright and licensing.