Cross-Validation and Validation Set Methods for Choosing K in KNN Algorithm for Healthcare Case Study

Authors

  • Robbi Rahim Sekolah Tinggi Ilmu Manajemen Sukma, Jl. Sakti Lubis, Kota Medan, Sumatera Utara, 20219, Indonesia https://orcid.org/0000-0001-6119-867X
  • Ansari Saleh Ahmar Department of Statistics, Universitas Negeri Makassar, Makassar, Indonesia http://orcid.org/0000-0001-6888-9043
  • Rahmat Hidayat Department of Information Technology, Politeknik Negeri Padang, Limau Manis, Padang, 25164, Indonesia

DOI:

https://doi.org/10.35877/454RI.jinav1557

Keywords:

KNN Algorithm, Euclidean Distance, Healthcare

Abstract

KNN categorization is simple and successful in healthcare. In this research's example case study, the KNN algorithm classified the new record as "Abnormal." The classification method began with choosing K, then calculating the Euclidean distance between the new record and the training set, finding the K nearest neighbors, then classifying the new record based on those K neighbors. The findings show that the KNN algorithm is effective in healthcare and highlight several shortcomings that should be addressed in future study. Weighting variables, choosing the best K value, and handling non-uniform data are these restrictions. The findings show the KNN algorithm's medical potential.

References

Adeniyi, D. A., Wei, Z., & Yongquan, Y. (2016). Automated web usage data mining and recommendation system using K-Nearest Neighbor (KNN) classification method. Applied Computing and Informatics, 12(1), 90–108. https://doi.org/10.1016/j.aci.2014.10.001

Barbancho, J., León, C., Molina, F. J., & Barbancho, A. (2007). Using artificial intelligence in routing schemes for wireless networks. Computer Communications, 30(14–15), 2802–2811. https://doi.org/10.1016/J.COMCOM.2007.05.023

Bilal, M., Israr, H., Shahid, M., & Khan, A. (2016). Sentiment classification of Roman-Urdu opinions using Naïve Bayesian, Decision Tree and KNN classification techniques. Journal of King Saud University - Computer and Information Sciences, 28(3), 330–344. https://doi.org/10.1016/J.JKSUCI.2015.11.003

Di Lorenzo, R., Ingarao, G., & Micari, F. (2006). On the use of artificial intelligence tools for fracture forecast in cold forming operations. Journal of Materials Processing Technology, 177(1–3), 315–318. https://doi.org/10.1016/J.JMATPROTEC.2006.04.032

Ding, J., Cheng, H. D., Xian, M., Zhang, Y., & Xu, F. (2015). Local-weighted Citation-kNN algorithm for breast ultrasound image classification. Optik, 126(24), 5188–5193. https://doi.org/10.1016/J.IJLEO.2015.09.231

Djurabekova, F. G., Domingos, R., Cerchiara, G., Castin, N., Vincent, E., & Malerba, L. (2007). Artificial intelligence applied to atomistic kinetic Monte Carlo simulations in Fe–Cu alloys. Nuclear Instruments and Methods in Physics Research Section B: Beam Interactions with Materials and Atoms, 255(1), 8–12. https://doi.org/10.1016/J.NIMB.2006.11.039

Haryanto, A. A., Taniar, D., & Adhinugraha, K. M. (2015). Group Reverse kNN Query optimisation. Journal of Computational Science, 11, 205–221. https://doi.org/10.1016/J.JOCS.2015.09.006

Huang, A., Xu, R., Chen, Y., & Guo, M. (2023). Research on multi-label user classification of social media based on ML-KNN algorithm. Technological Forecasting and Social Change, 188, 122271. https://doi.org/10.1016/J.TECHFORE.2022.122271

Malyada Vommi, A., & Krishna Battula, T. (2023). A hybrid filter-wrapper feature selection using Fuzzy KNN based on Bonferroni mean for medical datasets classification: A COVID-19 case study. Expert Systems with Applications, 119612. https://doi.org/10.1016/J.ESWA.2023.119612

RADACEANU, E. (2007). ARTIFICIAL INTELLIGENCE & ROBOTS FOR PERFORMANCE MANAGEMENT – SOME METHODIC ASPECTS. IFAC Proceedings Volumes, 40(18), 319–324. https://doi.org/10.3182/20070927-4-RO-3905.00053

Shao, Z., Taniar, D., & Adhinugraha, K. M. (2015). Range-kNN queries with privacy protection in a mobile environment. Pervasive and Mobile Computing, 24, 30–49. https://doi.org/10.1016/J.PMCJ.2015.05.004

Shokrzade, A., Ramezani, M., Akhlaghian Tab, F., & Abdulla Mohammad, M. (2021). A novel extreme learning machine based kNN classification method for dealing with big data. Expert Systems with Applications, 183, 115293. https://doi.org/10.1016/J.ESWA.2021.115293

Valero-Mas, J. J., Calvo-Zaragoza, J., & Rico-Juan, J. R. (2016). On the suitability of Prototype Selection methods for kNN classification with distributed data. Neurocomputing, 203, 150–160. https://doi.org/10.1016/J.NEUCOM.2016.04.018

Wang, Y., & Chaib-draa, B. (2016). KNN-based Kalman filter: An efficient and non-stationary method for Gaussian process regression. Knowledge-Based Systems, 114, 148–155. https://doi.org/10.1016/J.KNOSYS.2016.10.002

Yesilbudak, M., Sagiroglu, S., & Colak, I. (2017). A novel implementation of kNN classifier based on multi-tupled meteorological input data for wind power prediction. Energy Conversion and Management, 135, 434–444. https://doi.org/10.1016/J.ENCONMAN.2016.12.094

Downloads

Published

2022-07-31

How to Cite

Rahim, R., Ahmar, A. S., & Hidayat, R. (2022). Cross-Validation and Validation Set Methods for Choosing K in KNN Algorithm for Healthcare Case Study. JINAV: Journal of Information and Visualization, 3(1), 57–61. https://doi.org/10.35877/454RI.jinav1557

Issue

Section

Articles