Diagnosis of Head and Neck Cancer in Developing Countries Using a Stacked Ensemble Model


  •   Folake Akinbohun

  •   Ambrose Akinbohun

  •   Adekunle Daniel

  •   Oghenerukevwe Elohor Ojajuni


Head and neck cancers (HNC) are indicated when cells grow abnormally.  The incidence of HNC is on the increase owing to several factors. There is often late presentation that can result in loss of lives (mortality) especially in Africa due to paucity of specialists. These challenges prompted the development of a stacked ensemble model for diagnosis of HNC to facilitate prompt referral.  The data were collected which consists of 1473 instances with 18 features.   Information Gain was used for selecting important features and three supervised learning algorithms were deployed for the base learners: Decision Tree (C4.5), K-Nearest Neighbors and Naïve Bayes. The predictions of the base learners were combined and passed to meta learners: Logistic Model Tree (LMT). The result showed that Information Gain method with stacked LMTwas 95.11%. It was deduced that both Information Gain with stacked MLR produced higher accuracy that the base learners’ results. Hence, this stacked model can be used for diagnosis of HNC in healthcare systems.

Keywords: Decision Tree, Naïve Bayes, Sinonasal, Larynx


Vipul, N., Punita, L., Mranalini, V. and Rajan, Yadav (2015). Evaluation of fatigue in Head and Neck Cancer Patients undergoing (intensity modulated radiation theraphy) radiotherapy: A prospective study. Asian Journal of Oncology. 1(1). DOI: 10.4103/2454-6798.165111

GBD (2015). Mortality and Causes of Death, Collaborators. Global, Regional, and National Life Expectancy, All-Cause Mortality, and Cause-Specific Mortality for 249 Causes of Death, 1980-2015: A Systematic Analysis for the Global Burden of Disease Study 2015. Lancet. 388 (10053): 1459–1544.

World Health Organization (2014). World Cancer Report 2014. Chapter 5.8.

National Cancer Institute. Available at https://www.cancer.gov/about-cancer/understanding/statistics

Zhi, C., Minoru, N., Chen, H., Scott, P. R., Xuan Hui, J. A. M., Michael, R. B., Ana, P. K., Brandi, R. P., Laura, B., Mariah, M., Amanda, Choflet, K. S., Shinya, S., Kazuki, U., John, W. W., Todd, R. M. and Harry, Q. (2018). Evaluation of Classification and Regression Tree (CART) Model in Weight Loss Prediction Following Head and Neck Cancer Radiation Therapy. Advances in Radiation Oncology.3(3): 346–355

Amanda, C., Kousuke, S., Shinyasugiyama, Kazuki, U., John, W. W., Todd, R. M. and Harry, Q. (2018). Evaluation of Classification and Regression Tree (CART) Model in Weight Loss Prediction Following Head and Neck Cancer Radiation Therapy. Elsevier Inc. on Behalf of the American Society for Radiation Oncology. 3(3): 346–355

Adoga, A. A., Silas, O. A., Nimkur, T. L. (2009). Open Cervical Lymph Node Biopsy for Head and Neck Cancers: Any Benefit? Head Neck Oncol.1:9.

Hongxun, W., Zhaohong, D., Bingjie, Z. and Qianyun L.(2016). Classifier Model Based on Machine Learning Algorithms: Application to Differential Diagnosis of Suspicious Thyroid Nodules Via Sonography. American Journal of Roentgenology. 207 (4): 859-864

Massa, S. T., Osasuwa-Peters, N, Christopher, K.M., Arnold, L. D., Schootman, M., Walker, R. J. and Varvares, M. A. (2016). Competing Causes of Death in the Head and Neck Cancer Population. National Center for Biotechnology Information, U.S. National Library of Medicine Elsevier Ltd.

Vidhu, R. and Kiruthika, S. (2016). A New Feature Selection Method for Oral Cancer using Data Mining Techniques. International Journal Of Advanced Research In Computer and Communication Engineering (IJARCCE) 5 (1)

Prerana, P. S. and Khushboo, T. (2015). Predictive Data Mining for Diagnosis of Thyroid Disease using Neural Network. International Journal of Research in Management, Science & Technology. 3(2)

Durairaj, M. and Deepika R. (2015). Prediction of Acute Myeloid Leukemia Cancer Using Data Mining- A Survey. International Journal Of Emerging Technology and Innovative Engineering I (2)

Abdelghani, B. and Erhan, G. (2005). Predicting Breast Cancer Survivability using Data Mining Techniques. Department of Computer Science, the George Washington University, Washington DC

Sato, F., Shimada, Y., Selaru, F. M., Shibata, D., Maeda, M., Watanaabe, G., Mori, Y. S. S., Imamura, M. and Meltzer, S. J. (2005). Prediction of Survival in Patients with Esophageal Carcinoma Using Artificial Neural Networks. American Cancer Society.

Osiris Villacampa (2015). Feature Selection and Classification Methods for Decision Making: A Comparative Analysis. Nova Southeastern University NSUworks. CEC Theses and Dissertations, College of Engineering and Computing

Jasmina, N., Perica, S. and Dusan, B. (2011). Toward Optimal Feature Selection Using Ranking Methods and Classification Algorithm. Yugoslav Journal of Operations Research. 21(1): 119-135

Jiawei H., Micheline, K. and Jian, P. (2011). Data Mining: Concepts and Techniques 3rd Edition

Altman, N. S. (1992). An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression. 46 (3): 175–185.

Wu, X., Kumar V., Quinlan, J. R., Ghosh, J., Yang, Q., Motoda, H., Mclachlan, G. J., Ng, A., Philip S. B., Zhi-Hua Zhou, Y., Steinbach, M., Hand D. J., and Steinberg, D. (2007). Top 10 Algorithms in Data Mining. Knowledge Information System. Springer-Verlag London Limited

Sagar, S. N. (2012). A Comparative Study of Classification Techniques in Data Mining Algorithms. International Conference on Computer Science and Electronics Engineering

Han, J., Kamber, M. and Pei, J. (2012). Data Mining: Concepts and Techniques, 3rd Edition. Elsevier, Amsterdam

Landwehr, N., Hall, M. and Frank, E. (2005). Logistic Model Trees. Machine Learning.59: 161.


Download data is not yet available.


How to Cite
Akinbohun, F., Akinbohun, A., Daniel, A. and Ojajuni, O. 2020. Diagnosis of Head and Neck Cancer in Developing Countries Using a Stacked Ensemble Model. European Journal of Engineering Research and Science. 5, 9 (Sep. 2020), 1097-1101. DOI:https://doi.org/10.24018/ejers.2020.5.9.2095.