Naive Bayes and KNN for Airline Passenger Satisfaction Classification: Comparative Analysis

Annisa Nurdina; Audita Bella Intan Puspita

doi:10.52465/joiser.v1i2.167

PDF

Published: Jul 14, 2023

DOI: https://doi.org/10.52465/joiser.v1i2.167

Article Metrics

Keywords:

Airlines Passenger, K-Nearest Neighbour, Naive Bayes, Classification, Data Mining

Annisa Nurdina

Universitas Negeri Semarang, Indonesia

Audita Bella Intan Puspita

Universitas Negeri Semarang, Indonesia

Abstract

Air transportation is vital due to technological advancements and globalization. It is affordable and accessible worldwide, providing efficient services to reach destinations globally. This discussion focuses on full-service airlines that offer online-based services. Previous research indicates that available facilities and services influence passenger satisfaction. Previous research on customer satisfaction showed a correlation between satisfaction and services without accurate figures. In the present study, the customer satisfaction figure is measured using the Naive Bayes and K-Nearest Neighbour (K-NN) algorithm to obtain a tested level of accuracy. In this analysis, we will compare the effectiveness of Naive Bayes and K-NN algorithms in classifying airline passenger satisfaction. The results show that the accuracy of the Naive Bayes method of the two algorithms is higher than the K-NN method. The accuracy value of the Naive Bayes method is 84.48%, while the accuracy value of the K-NN method is 65.38%. From the test results, the precision value for Naive Bayes is 82.25%, and K-NN is 67.35%. Furthermore, the recall value for Naive Bayes is 82.43%, and K-NN is 74.33%.

How to Cite

Nurdina, A., & Puspita, A. B. I. (2023). Naive Bayes and KNN for Airline Passenger Satisfaction Classification: Comparative Analysis. Journal of Information System Exploration and Research, 1(2). https://doi.org/10.52465/joiser.v1i2.167

Issue

Vol. 1 No. 2 (2023): July 2023

Section

Articles

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

References

Z. Gong, F. Zhang, W. Liu, and D. J. Graham, "On the effects of airport capacity expansion under responsive airlines and elastic passenger demand," Transportation Research Part B: Methodological, vol. 170, pp. 48–76, Apr. 2023, doi: 10.1016/J.TRB.2023.02.010.

X. Shu and Y. Ye, "Knowledge Discovery: Methods from data mining and machine learning," Soc Sci Res, vol. 110, Feb. 2023, doi: 10.1016/j.ssresearch.2022.102817.

M. T. Alshurideh, A. Al-Hadrami, E. K. Alquqa, H. M. Alzoubi, S. Hamadneh, and B. Al Kurdi, "The effect of lean and agile operations strategy on improving order-winners: Empirical evidence from the UAE food service industry," Uncertain Supply Chain Management, vol. 11, no. 1, pp. 87–94, Dec. 2023, doi: 10.5267/J.USCM.2022.11.007.

Y. Liu and K. Tahera, "A fuzzy decision-making approach for testing activity prioritization and its application in an engine company," Appl Soft Comput, vol. 142, Jul. 2023, doi: 10.1016/J.ASOC.2023.110367.

L. Lopez-Valpuesta and D. Casas-Albala, "Has passenger satisfaction at airports changed with the onset of COVID-19? The case of Seville Airport (Spain)," J Air Transp Manag, vol. 108, p. 102361, May 2023, doi: 10.1016/J.JAIRTRAMAN.2023.102361.

J. C. Weng, J. B. Yu, X. J. Di, P. F. Lin, J. J. Wang, and L. Z. Mao, "How does the state of bus operations influence passengers' service satisfaction? A method considering the differences in passenger preferences," Transp Res Part A Policy Pract, vol. 174, p. 103734, Aug. 2023, doi: 10.1016/J.TRA.2023.103734.

J. Duque, F. Silva, and A. Godinho, "Data Mining applied to Knowledge Management," Procedia Comput Sci, vol. 219, pp. 455–461, 2023, doi: 10.1016/j.procs.2023.01.312.

S. Adhikary and S. Banerjee, “Introduction to Distributed Nearest Hash: On Further Optimizing Cloud Based Distributed kNN Variant,” Procedia Comput Sci, vol. 218, pp. 1571–1580, 2023, doi: 10.1016/j.procs.2023.01.135.

T. Noviantoro and J.-P. Huang, "Investigating airline passenger satisfaction: Data mining method," Research in Transportation Business & Management, vol. 43, p. 100726, Jun. 2022, doi: 10.1016/j.rtbm.2021.100726.

Y. Zhang, L. Li, Y. Li, and Z. Zeng, "Machine learning model-based risk prediction of severe complications after off-pump coronary artery bypass grafting," Adv Clin Exp Med, vol. 32, no. 2, pp. 185–194, Feb. 2023, doi: 10.17219/ACEM/152895.

S. Sharma, K. M. Osei-Bryson, and G. M. Kasper, "Evaluation of an integrated Knowledge Discovery and Data Mining process model," Expert Syst Appl, vol. 39, no. 13, pp. 11335–11348, Oct. 2012, doi: 10.1016/J.ESWA.2012.02.044.

H. H. P. Nucci et al., "Use of computer vision to verify the viability of guavira seeds treated with tetrazolium salt," Smart Agricultural Technology, vol. 5, Oct. 2023, doi: 10.1016/J.ATECH.2023.100239.

T. T. Nguyen et al., "Scalable maximal subgraph mining with backbone-preserving graph convolutions," Inf Sci (N Y), vol. 644, Oct. 2023, doi: 10.1016/J.INS.2023.119287.

X. Shu and Y. Ye, "Knowledge Discovery: Methods from data mining and machine learning," Soc Sci Res, vol. 110, Feb. 2023, doi: 10.1016/j.ssresearch.2022.102817.

A. Yazdinejad, A. Dehghantanha, R. M. Parizi, and G. Epiphaniou, "An optimized fuzzy deep learning model for data classification based on NSGA-II," Neurocomputing, vol. 522, pp. 116–128, Feb. 2023, doi: 10.1016/J.NEUCOM.2022.12.027.

C. Singla and C. Jindal, "Comparison of Various Classification Models Using Machine Learning to Predict Mobile Phones Price Range," Convergence of Cloud with AI for Big Data Analytics, pp. 401–419, May 2023, doi: 10.1002/9781119905233.CH17.

F. Carli, M. Leonelli, and G. Varando, "A new class of generative classifiers based on staged tree models," Knowl Based Syst, vol. 268, p. 110488, May 2023, doi: 10.1016/J.KNOSYS.2023.110488.

A. M. Shanshool, E. M. H. Saeed, and H. H. Khaleel, "Comparison of various data mining methods for early diagnosis of human cardiology," IAES International Journal of Artificial Intelligence, vol. 12, no. 3, pp. 1343–1351, Sep. 2023, doi: 10.11591/IJAI.V12.I3.PP1343-1351.

K. V and S. P. S, "Adaptive boosted random forest-support vector machine based classification scheme for speaker identification," Appl Soft Comput, vol. 131, p. 109826, Dec. 2022, doi: 10.1016/J.ASOC.2022.109826.

A. Ali, M. Hamraz, N. Gul, D. M. Khan, S. Aldahmani, and Z. Khan, "A k nearest neighbour ensemble via extended neighbourhood rule and feature subsets," Pattern Recognit, vol. 142, p. 109641, Oct. 2023, doi: 10.1016/J.PATCOG.2023.109641.

W. Zhang, P. Li, L. Wang, F. Wan, J. Wu, and L. Yong, "Explaining of prediction accuracy on phase selection of amorphous alloys and high entropy alloys using support vector machines in machine learning," Mater Today Commun, vol. 35, p. 105694, Jun. 2023, doi: 10.1016/J.MTCOMM.2023.105694.

A. M. Mariano, A. B. De Magalhães Lelis Ferreira, M. R. Santos, M. L. Castilho, and A. C. F. L. C. Bastos, "Decision trees for predicting dropout in Engineering Course students in Brazil," Procedia Comput Sci, vol. 214, no. C, pp. 1113–1120, Jan. 2022, doi: 10.1016/J.PROCS.2022.11.285.

M. Z. Naser, "Machine learning for all! Benchmarking automated, explainable, and coding-free platforms on civil and environmental engineering problems," Journal of Infrastructure Intelligence and Resilience, vol. 2, no. 1, p. 100028, Mar. 2023, doi: 10.1016/J.IINTEL.2023.100028.

İ. Aksangür, B. Eren, and C. Erden, "Evaluation of data preprocessing and feature selection process for prediction of hourly PM10 concentration using long short-term memory models," Environmental Pollution, vol. 311, p. 119973, Oct. 2022, doi: 10.1016/J.ENVPOL.2022.119973.

J. Santos-Pereira, L. Gruenwald, and J. Bernardino, "Top data mining tools for the healthcare industry," Journal of King Saud University - Computer and Information Sciences, vol. 34, no. 8, pp. 4968–4982, Sep. 2022, doi: 10.1016/J.JKSUCI.2021.06.002.

Abstract viewed = 1449 times

Article Sidebar

Main Article Content

Abstract

Article Details

References