SVM Optimization with Correlation Feature Selection Based Binary Particle Swarm Optimization for Diagnosis of Chronic Kidney Disease

Main Article Content

Doni Aprilianto

Abstract

Data mining has been widely used to diagnose diseases from medical data. In this study using chronic kidney disease dataset taken from UCI Machine Learning. The dataset has 25 attributes with 400 samples. With 25 attributes that allow redundant data. Redundant data in datasets can reduce computational efficiency and classification accuracy. To increase accuracy of classification algorithm can be done by reducing dimensions of dataset. Correlation-based Feature Selection (CFS) can quickly identify and filter redundant attributes. However, CFS has disadvantage that selected attribute is not necessarily the best attribute. These weaknesses can be overcome by Binary Particle Swarm Optimization (BPSO). BPSO chooses attributes based on the best fitness value. The purpose of this study is to improve accuracy of Support Vector Machine (SVM) by implementing combination of CFS and BPSO as feature selection. Accuracy of SVM in predicting CKD is 63.75%. Whereas, accuracy of SVM by applying CFS as feature selection is 88.75% and average accuracy of ten execution SVM algorithms by applying a combination of CFS and BPSO as feature selection is 95%. Thus, combination of CFS and BPSO as feature selection on the SVM algorithm can improve results of accuracy in diagnosing CKD by 31.25%.

Downloads

Download data is not yet available.

Article Details

How to Cite
[1]
D. Aprilianto, “SVM Optimization with Correlation Feature Selection Based Binary Particle Swarm Optimization for Diagnosis of Chronic Kidney Disease”, J. Soft Comput. Explor., vol. 1, no. 1, pp. 24-31, Oct. 2020.
Section
Articles

References

C. Sreedhar, N. Kasiviswanath, and P. C. Reddy, “Clustering large datasets using K‑means modifed inter and intra clustering (KM‑I2C) in Hadoop, ” J. of Big Data, vol. 27, no. 4, pp. 1-19, 2017.

M.A. Muslim, S. H. Rukmana, E. Sugiharti, B. Prasetiyo, and S. Alimah, “Optimization of C4.5 algorithm-based particle swarm optimization for breast cancer diagnosis, ” presented at the 5th Int. Conf on Mathematics, Science and Education, Bali, Indonesia, Oct. 8–9, 2018.

D. O. Sahin, and E. Kılıc, “Two new feature selection metrics for text classification, ” Automatika, vol.60, no. 2, pp. 162-171, 2019

V. Kotu, and B. Deshpande, Predictive Analytics and Data Mining. Massachusetts, USA: Morgan Kaufmann, 2015, pp. 63-163.

I. Jain, V. K. Jain, and R. Jain, “Correlation feature selection based improved-Binary Particle Swarm Optimization for gene selection and cancer classification, ” Appl. Soft Comp., vol. 62, no.-, pp. 203-215. 2018.

K. Sutha, and J. J. Tamilselvi, “A review of feature selection algorithms for data mining techniques, ” Inte. J. on Comp. Sci. & Eng., vol. 7, no. 6, pp. 63-67, 2015.

P. Yildirim, “Filter based feature selection methods for prediction of risks in hepatitis disease, ” Int. J. of Mach. Lear. & Comp., vol. 5, no. 4, pp. 258-263. 2016.

S. Sasikala, S. Appavu, and S. Geetha, “Multi Filtration Feature Selection (MFFS) to improve discriminatory ability in clinical data set, ” Appl. Comp. & Info., vol. 12. no.-, pp. 117-127. 2017.

I.A. Ashari, M.A. Muslim, and Alamsyah. “Comparison Performance of Genetic Algorithm and Ant Colony Optimization in Course Scheduling Optimizing, ” Sci. J. of Info. vol. 3, no. 2, pp. 149-158, 2016.

M.S. Muhammad, K.V. Selvan, S.M.W. Masra, Z. Ibrahim, and A.F.Z. Abidin, “An Improved Binary Particle Swarm Optimization Algorithm for DNA Encoding Enhancement, ” presented at the IEEE Symposium on Swarm Intelligence, Paris, France, April 11-14, 2011.

W. Abedalkhader, and N. Abdulrahman, “Missing Data Classification of Chronic Kidney Disease, ”, Int. J. of Data Mining & Knowledge Manage. Process, vol. 7, no. 5, pp. 55-61.2017

N. Gopika, and A. M. E. M. Kowshalaya, “Correlation based feature selection algorithm for machine learning, ” presented at the 3rd Int. Conf. on Commu. & Elec. Sys., Coimbatore, India, Oct. 15-18, 2018.

N. D. Jana, and J. Sil, “Interleaving of Particle Swarm Optimization And Differential Evolution Algorithm For Global Optimization, ” Int. J. of Comp. & Appl., vol. 38. no. -, pp. 116-133, 2016.

J. Nayak, B. Naik, and H. S. Behera, “A Comprehensive Survey on Support Vector Machine in Data Mining Tasks: Applications & Challenges, ” Int. J. of Data. Theory & Appl., vol. 8, no.-, pp. 169-186. 2016.

D. K. Srivastava, and L. Bhambhu, “Data classification using support vector machine, ” J. of Theoretical & Appl. Inf. Tech., vol. 12, no.-, pp. 1-7. Feb 2010.

Abstract viewed = 426 times