Classification Modelling: A Case Study of Breast Cancer Patients of Islamabad

  • Aansa Abbas Government Associate College for women, Gawalmandi, Rawalpindi, Pakistan
  • Muhammad Zakria Allama Iqbal Open University, Islamabad, Pakistan
  • Muhammad Kashif University of Agriculture, Faisalabad, Pakistan https://orcid.org/0000-0002-8682-5657
Keywords: Logistic regression, Discriminant Analysis, Mastectomy, Breast-Conserving Surgery, Classification

Abstract

Purpose: The rate of breast cancer in Pakistan is the highest among all other Asian countries and all other types of cancer. The foremost treatment for breast cancer patients of stage 2 and stage 3 is surgery. The main types of surgery in this era are Mastectomy and Breast Conservative surgery. The decision about the type of surgery depends on the demographic and clinical factors.

Approach: In this study, the seven characteristics have been considered.  A purposive sample of 365 breast cancer patients were collected from the two main hospitals in Islamabad. The foremost objective of this study was to classify each breast cancer patient regarding surgery type based on significant explanatory characteristics. The binary logistics regression and discriminant analysis techniques were used and the significance of each parameter was tested.

Findings: The main effects i.e., age, tumor size, Estrogen Receptor, and Progesterone Receptor were found to be significant with some diverse probabilities and all two-factor interactions were found to be non-significant. The sensitivity of logistic regression and discriminant analysis is almost the same i.e., 93.1% and 92.8% respectively whereas the specificity of these two techniques is also almost the same i.e., 70.8% and 71.9% respectively. The overall actual correct classify rate and Apparent error rate of both these techniques are found to be 87.7% and 12.3% respectively.

Implications: In brief, it was deducted that the Tumor size stage is the most imperative characteristic among other significant characteristics in discriminating between two types of surgery

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Author Biographies

Aansa Abbas, Government Associate College for women, Gawalmandi, Rawalpindi, Pakistan

Assistant Professor, Government Associate College for women, Gawalmandi, Rawalpindi, Pakistan

Muhammad Zakria, Allama Iqbal Open University, Islamabad, Pakistan

Associate professor, Department of Statistics, Allama Iqbal Open University, Islamabad, Pakistan

Muhammad Kashif, University of Agriculture, Faisalabad, Pakistan

Lecturer, Department of Mathematics & Statistics, University of Agriculture, Faisalabad, Pakistan

References

Agresti, A. (2007). An introduction to Categorical data analysis (2nd ed.). A John Wiley & Sons, Inc. DOI: https://doi.org/10.1002/0470114754

Al-Gaithy, Z. K., Yaghmoor, B. E., Koumu, M. I., Alshehri, K. A., Saqah, A. A., & Alshehri, H. Z. (2019). Trend of mastectomy and breast conserving surgery and related factors in female breast cancer patients treated at king Abdul-Aziz University Hospital, Jeddah, Saudi Arabia, 2009-17. A retrospective cohort study. Annals of Medicine and Surgery, 41(10), 47-52. DOI: https://doi.org/10.1016/j.amsu.2019.03.012

American Cancer Society. (2021). Breast cancer Facts and Figure 2020-2021. Atlanta.

Barznji, N. S. (2018). Using logistic regression analysis and linear discriminant analysis to identify the risk factor of diabetes. Zanco Journal of Humanity Sciences, 22(6), 248-268. DOI: https://doi.org/10.21271/zjhs.22.6.17

Balogun, O. S., Akingbade, T. J., & Oguntunde, P. E. (2015). An assessment of the performance of discriminant analysis and the logistic regression methods in classification of Mode of delivery of expectant mother. Journal of Mathematical theory and Modeling, 5(5), 147-154.

Elgohari, H. (2017). Efficiency of discriminant analysis and multivariate logistic regression for the detection of anemic children with chronic kidney disease. International Journal of Statistics and Applications, 7(2), 131-136.

Hair, J. F., Jr., Black, W. C., Babin, B. J., & Anderson, R. E. (2010). Multivariate data analysis (7th ed.). Pearson Prentice Hall.

Hahs-Vaughn, D. L. (2017). Applied multivariate statistical concepts. Routledge. DOI: https://doi.org/10.4324/9781315816685

Hosmer, D. W., Lemeshow, S., & Sturdivant, R. X. (2013). Applied logistic regression (3rd ed.). Johnson Wiley & Sons, Inc. DOI: https://doi.org/10.1002/9781118548387

Huberty, C. j., & Oleinik, S. (2006). Applied MANOVA and Discriminant Analysis (2nd ed.). A John Wiley & Sons, Inc. DOI: https://doi.org/10.1002/047178947X

Johnson, R. A., & Wichern, D. W. (2007). Applied multivariate statistical analysis (6th ed.). Pearson Prentice Hall. DOI: https://doi.org/10.1002/0471667196.ess6094

Liong, C. Y., & Foo, S. F. (2013). Comparison of linear discriminant analysis and logistic regression for data set classification. AIP Conference Proceedings, 1522(1), 1159-1165. DOI: https://doi.org/10.1063/1.4801262

Lo, A. W. (1986). Logit versus discriminant analysis. A specification test and application to corporate bankruptcies. Journal of Econometrics, 31(2), 151–178. DOI: https://doi.org/10.1016/0304-4076(86)90046-1

Mohamed, S., & Shaheen, A. (2019). The robustness of binary logistic regression and linear discriminant analysis for the classification and differentiation of BTV cases in goats, 122–126.

Moiel, D., Thompson, J., & Larsen, K. D. (2019). Mastectomy or Breast-Conserving Therapy: Which Factors Influence A Patient's Decision? The Permanente journal, 23(49), 1185-1195.

Montgomery, M. E., White, M. E., & Martin, S. W. (1987). A comparison of discriminant analysis and logistic regression for the prediction of coliform mastitis in dairy cows. Canadian Journal of Veterinary Research, 51(4), 495-498.

Musa, A. B., Abedairaheem, A. A. A., Hamad, H., & Shaheen, S. M. A. (2019). The robustness of binary logistic regression and linear discriminant analysis for the classification and differentiation of BTV cases in goats. International journal of Multidisciplinary Research and Development, 6(11), 122-126.

National Cancer Institute. (2015). Understanding cancer. Retrieved from www.cancer.gov/ publication.

Penn, C. C. (2020). Comparison of discriminant analysis and logistic regression for classification of borrowers of microfinance institutes. International journal of commerce and management Research, 6(6), 20-25.

Rencher, A. C. (2002). Methods of Multivariate Analysis (2nd ed.). A John Wiley & Sons, Inc. DOI: https://doi.org/10.1002/0471271357

Shayan, Z., Meazerji, N. M. G., Shayan, L. & Naseri, P. (2015). Prediction of depression in cancer patients with different classification criteria, Linear Discriminant Analysis versus Logistic Regression. Global journal of health Science, 8(7), 41-46. DOI: https://doi.org/10.5539/gjhs.v8n7p41

Upton, G. J. (2017). Categorical Data Analysis by Example. A John Wiley & Sons, Inc.

World Health Organization. (2021). Breast Cancer. Geneva: WHO.

World Health Organization. (2021). Cancer. Geneva: WHO.

World Health Organization. (2021). Asia [Fact sheet]. Geneva: WHO.

Zangmo, C. & Tiensuwan, M. (2018). Application of logistic regression models to cancer patients: A case study of data from Jigme Dorji Wangchuck National Referral Hospital in Bhuttan. Journal of Physics: Conference series, 1039(1), 110-115. DOI: https://doi.org/10.1088/1742-6596/1039/1/012031

Published
2022-06-30
How to Cite
Abbas, A., Zakria, M., & Kashif, M. (2022). Classification Modelling: A Case Study of Breast Cancer Patients of Islamabad. Sustainable Business and Society in Emerging Economies, 4(2), 239-248. https://doi.org/10.26710/sbsee.v4i2.2271