Optimization of breast cancer classification using feature selection on neural network
Main Article Content
Abstract
Cancer is currently one of the leading causes of death worldwide. One of the most common cancers, especially among women, is breast cancer. There is a major problem for cancer experts in accurately predicting the survival of cancer patients. The presence of machine learning to further study it has attracted a lot of attention in the hope of obtaining accurate results, but its modeling methods and predictive performance remain controversial. Some Methods of machine learning that are widely used to overcome this case of breast cancer prediction are Backpropagation. Backpropagation has an advantage over other Neural Networks, namely Backpropagation using supervised training. The weakness of Backpropagation is that it handles classification with high-dimensional datasets so that the accuracy is low. This study aims to build a classification system for detecting breasts using the Backpropagation method, by adding a method of forward selection for feature selection from the many features that exist in the breast cancer dataset, because not all features can be used in the classification process. The results of combining the Backpropagation method and the method of forward selection can increase the detection accuracy of breast cancer patients by 98.3%.
Downloads
Article Details
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
References
R. H. Saputra and B. Prasetyo, “Improve the Accuracy of C4.5 Algorithm Using Particle Swarm Optimization (PSO) Feature Selection and Bagging Technique in Breast Cancer Diagnosis,” J. Soft Comput. Explor., vol. 1, no. 1, pp. 47–55, 2020, https://doi.org/10.52465/joscex.v1i1.9.
G. Yu, Z. Chen, J. Wu, and Y. Tan, “A diagnostic prediction framework on auxiliary medical system for breast cancer in developing countries,” Knowledge-Based Syst., vol. 232, p. 107459, 2021, doi: 10.1016/j.knosys.2021.107459.
M. Lilleborge, R. S. Falk, T. Sørlie, G. Ursin, and S. Hofvind, “Can breast cancer be stopped? Modifiable risk factors of breast cancer among women with a prior benign or premalignant lesion,” Int. J. Cancer, vol. 149, no. 6, pp. 1247–1256, 2021, doi: 10.1002/ijc.33680.
B. Prasetiyo, Alamsyah, M. A. Muslim, Subhan, and N. Baroroh, “Artificial neural network model for banckrupty prediction,” J. Phys. Conf. Ser., vol. 1567, no. 3, pp. 8–12, 2020, doi: 10.1088/1742-6596/1567/3/032022.
R. B. Dickson and G. M. Stancel, “Estrogen receptor-mediated processes in normal and cancer cells.,” J. Natl. Cancer Inst. Monogr., no. 27, pp. 135–145, 2000, doi: 10.1093/oxfordjournals.jncimonographs.a024237.
A. Binder et al., “Morphological and molecular breast cancer profiling through explainable machine learning,” Nat. Mach. Intell., vol. 3, no. 4, pp. 355–366, 2021, doi: 10.1038/s42256-021-00303-4.
J. Li et al., “Predicting breast cancer 5-year survival using machine learning: A systematic review,” PLoS One, vol. 16, no. 4 April, pp. 1–23, 2021, doi: 10.1371/journal.pone.0250370.
H. Saleh, S. F. Abd-El Ghany, H. Alyami, and W. Alosaimi, “Predicting Breast Cancer Based on Optimized Deep Learning Approach,” Comput. Intell. Neurosci., vol. 2022, 2022, doi: 10.1155/2022/1820777.
I. G. A. Suciningsih, M. A. Hidayat, and R. A. Hapsari, “Comparation analysis of naïve bayes and decision tree C4.5 for caesarean section prediction,” J. Soft Comput. Explor., vol. 2, no. 1, pp. 46–52, 2021, doi: 10.52465/joscex.v2i1.25.
M. Karabatak and M. C. Ince, “An expert system for detection of breast cancer based on association rules and neural network,” Expert Syst. Appl., vol. 36, no. 2 PART 2, pp. 3465–3469, 2009, doi: 10.1016/j.eswa.2008.02.064.
N. Salmi and Z. Rustam, “Naïve Bayes Classifier Models for Predicting the Colon Cancer,” IOP Conf. Ser. Mater. Sci. Eng., vol. 546, no. 5, 2019, doi: 10.1088/1757-899X/546/5/052068.
C. Agossou, M. N. Atchadé, A. Moussa Djibril, and S. V. Kurisheva, “Support Vector Machine, Naive Bayes Classification, and Mathematical Modeling for Public Health Decision-Making: A Case Study of Breast Cancer in Benin,” SN Comput. Sci., vol. 3, no. 2, pp. 1–19, 2022, doi: 10.1007/s42979-021-01008-6.
M. Ibtasam, “Accuracy Measurements and Decision Making by Naïve Bayes and Forward Chaining Method to Identify the Malnutrition Causes and Symptoms,” Sci. J. Informatics, vol. 8, no. 2, pp. 320–324, 2021, doi: 10.15294/sji.v8i2.29317.
M. M. Islam, H. Iqbal, M. R. Haque, and M. K. Hasan, “Prediction of breast cancer using support vector machine and K-Nearest neighbors,” 5th IEEE Reg. 10 Humanit. Technol. Conf. 2017, R10-HTC 2017, vol. 2018-January, pp. 226–229, 2018, doi: 10.1109/R10-HTC.2017.8288944.
L. Tapak, N. Shirmohammadi-Khorram, P. Amini, B. Alafchi, O. Hamidi, and J. Poorolajal, “Prediction of survival and metastasis in breast cancer patients using machine learning classifiers,” Clin. Epidemiol. Glob. Heal., vol. 7, no. 3, pp. 293–299, 2019, doi: 10.1016/j.cegh.2018.10.003.
N. Hidayat, M. F. Al Hakim, and J. Jumanto, “Halal Food Restaurant Classification Based on Restaurant Review in Indonesian Language Using Machine Learning,” Sci. J. Informatics, vol. 8, no. 2, pp. 314–319, 2021, doi: 10.15294/sji.v8i2.33395.
S. Senthil and B. Ayshwarya, “Lung Cancer Prediction using Feed Forward Back Propagation Neural Networks with Optimal Features,” Int. J. Appl. Eng. Res., vol. 13, no. 1, pp. 318–325, 2018, doi:10.37622/000000.
R. Jayapermana, A. Aradea, and N. I. Kurniati, “Implementation of Stacking Ensemble Classifier for Multi-class Classification of COVID-19 Vaccines Topics on Twitter,” Sci. J. Informatics, vol. 9, no. 1, pp. 8–15, 2022, doi: 10.15294/sji.v9i1.31648.
D. I. Wijaya, M. K. Aulia, Jumanto, and M. F. Al Hakim, “Room occupancy classification using multilayer perceptron,” J. Soft Comput. Explor., vol. 2, no. 2, pp. 163–168, 2021, doi: https://doi.org/10.52465/joscex.v2i2.
A. Agustyawan, T. G. Laksana, and U. Athiyah, “Combination of Backpropagation Neural Network and Particle Swarm Optimization for Water Production Prediction in Municipal Waterworks,” Sci. J. Informatics, vol. 9, no. 1, pp. 84–94, 2022, doi: 10.15294/sji.v9i1.29849.
R. S. Wahono, N. S. Herman, and S. Ahmad, “Neural network parameter optimization based on genetic algorithm for software defect prediction,” Adv. Sci. Lett., vol. 20, no. 10–12, pp. 1951–1955, 2014, doi: 10.1166/asl.2014.5641.
N. Nikolaou, N. Edakunni, M. Kull, P. Flach, and G. Brown, “Cost-sensitive boosting algorithms: Do we really need them?,” Mach. Learn., vol. 104, no. 2–3, pp. 359–384, 2016, doi: 10.1007/s10994-016-5572-x.
D. Guan, W. Yuan, Z. Jin, and S. Lee, “Undiagnosed samples aided rough set feature selection for medical data,” Proc. 2012 2nd IEEE Int. Conf. Parallel, Distrib. Grid Comput. PDGC 2012, pp. 639–644, 2012, doi: 10.1109/PDGC.2012.6449895.
H. Meyer, C. Reudenbach, T. Hengl, M. Katurji, and T. Nauss, “Improving performance of spatio-temporal machine learning models using forward feature selection and target-oriented validation,” Environ. Model. Softw., vol. 101, pp. 1–9, 2018, doi: 10.1016/j.envsoft.2017.12.001.