Ensemble learning technique to improve breast cancer classification model
Main Article Content
Abstract
Cancer is a disease characterized by abnormal cell growth and is not contagious, such as breast cancer which can affect both men and women. breast cancer is one of the cancer diseases that is classified as dangerous and takes many victims. However, the biggest problem in this study is that the classification method is low and the resulting accuracy is less than optimal. the purpose of this study is to improve the accuracy of breast cancer classification. Therefore, a new method is proposed, namely ensemble learning which combines logistic regression, decision tree, and random forest methods, with a voting system. This system is useful for finding the best results on each parameter that will produce the best prediction accuracy. The prediction results from this method reached an accuracy of 98.24%. The resulting accuracy rate is more optimal by using the proposed method.
Downloads
Article Details
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
References
E. Rizkyani, N. Aliffiyanti Iskandar, and N. Chamidah, “Klasifikasi dalam Mendeteksi Penyakit Kanker Payudara dengan Menggunakan Metode Random Forest dan Adaboost,” Semin. Nas. Mhs. Ilmu Komput. dan Apl. Jakarta-Indonesia, no. September, pp. 335–343, 2021.
N. Sharma, K. P. Sharma, M. Mangla, and R. Rani, “Breast cancer classification using snapshot ensemble deep learning model and t-distributed stochastic neighbor embedding,” pp. 4011–4029, 2023.
Z. Huang and D. Chen, “A Breast Cancer Diagnosis Method Based on VIM Feature Selection and Hierarchical Clustering Random Forest Algorithm,” IEEE Access, vol. 10, pp. 3284–3293, 2022.
G. Li et al., “Effective Breast Cancer Recognition Based on Fine-Grained Feature Selection,” IEEE Access, vol. 8, 2020.
J. Jumanto, M. F. Mardiansyah, R. Pratama, M. F. Al Hakim, and B. Rawat, “Optimization of breast cancer classification using feature selection on neural network,” J. Soft Comput. Explor., vol. 3, no. 2, pp. 105–110, 2022.
S. Punitha, A. Amuthan, and K. S. Joseph, “Benign and malignant breast cancer segmentation using optimized region growing technique,” Futur. Comput. Informatics J., vol. 3, no. 2, pp. 348–358, Dec. 2018.
A. Nugraheni, R. D. Ramadhani, A. B. Arifa, and A. Prasetiadi, “Perbandingan Performa Antara Algoritma Naive Bayes Dan K-Nearest Neighbour Pada Klasifikasi Kanker Payudara,” J. Dinda Data Sci. Inf. Technol. Data Anal., vol. 2, no. 1, pp. 11–20, 2022.
I. Country-specific, N. Method, and M. Country-specific, “273 523 621,” vol. 858, pp. 2020–2021, 2021.
M. A. Muslim et al., “New model combination meta-learner to improve accuracy prediction P2P lending with stacking ensemble learning,” Intell. Syst. with Appl., vol. 18, p. 200204, May 2023.
H. Chen, N. Wang, X. Du, K. Mei, Y. Zhou, and G. Cai, “Classification Prediction of Breast Cancer Based on Machine Learning,” Comput. Intell. Neurosci., vol. 2023, pp. 1–9, 2023.
L. Khairunnahar, M. A. Hasib, R. H. Bin Rezanur, M. R. Islam, and M. K. Hosain, “Classification of malignant and benign tissue with logistic regression,” Informatics Med. Unlocked, vol. 16, no. May, p. 100189, 2019.
M. J. Abdulaal, M. I. Ibrahem, A. M. Abusorrah, and S. Member, “Real-Time Detection of False Readings in Smart Grid AMI Using Deep and Ensemble Learning,” IEEE Access, vol. 10, pp. 47541–47556, 2022.
S. Nanglia, M. Ahmad, F. Ali Khan, and N. Z. Jhanjhi, “An enhanced Predictive heterogeneous ensemble model for breast cancer prediction,” Biomed. Signal Process. Control, vol. 72, p. 103279, Feb. 2022.
A. S. Assiri, S. Nazir, and S. A. Velastin, “Breast Tumor Classification Using an Ensemble Machine Learning Method,” J. Imaging, vol. 6, no. 6, p. 39, May 2020.
B. Senthilkumar et al., “Ensemble Modelling for Early Breast Cancer Prediction from Diet and Lifestyle,” IFAC-PapersOnLine, vol. 55, no. 1, pp. 429–435, 2022.
F. Gorgan, M. Taher, R. Mohammad, and Z. Kermani, “Decision tree models in predicting water quality parameters of dissolved oxygen and phosphorus in lake water,” Sustain. Water Resour. Manag., vol. 9, no. 1, pp. 1–13, 2023.
M. A. Muslim et al., “An Ensemble Stacking Algorithm to Improve Model Accuracy in Bankruptcy Prediction,” J. Data Sci. Intell. Syst., 2023.
S. Hafeez, S. S. Alotaibi, A. Alazeb, N. A. L. Mudawi, and W. Kim, “Multi-sensor-based Action Monitoring and Recognition via Hybrid Descriptors and Logistic Regression,” IEEE Access, vol. PP, p. 1, 2023.
Z. Khandezamin, M. Naderan, and M. J. Rashti, “Detection and classification of breast cancer using logistic regression feature selection and GMDH classifier,” J. Biomed. Inform., vol. 111, p. 103591, Nov. 2020.
S. Demir and E. K. Sahin, “Comparison of tree-based machine learning algorithms for predicting liquefaction potential using canonical correlation forest , rotation forest , and random forest based on CPT data,” Soil Dyn. Earthq. Eng., vol. 154, no. December 2021, p. 107130, 2022.
M. Minnoor and V. Baths, “Diagnosis of Breast Cancer Using Random Forests,” Procedia Comput. Sci., vol. 218, pp. 429–437, 2023.
B. Dai, R.-C. Chen, S.-Z. Zhu, and W.-W. Zhang, “Using Random Forest Algorithm for Breast Cancer Diagnosis,” in 2018 International Symposium on Computer, Consumer and Control (IS3C), 2018, pp. 449–452.
W. Zhang, H. Li, L. Han, L. Chen, and L. Wang, “Slope stability prediction using ensemble learning techniques: A case study in Yunyang County, Chongqing, China,” J. Rock Mech. Geotech. Eng., vol. 14, no. 4, pp. 1089–1099, Aug. 2022.