Classification of Pancreatic Cancer Diagnosis with CatBoost Using Urine Biomarker Combination

Main Article Content

Yulizchia Malica Pinkan Tanga
Putri Utami
Aditya Yoga Darmawan
Jumanto Unjung

Abstract

Uncontrolled cell growth in the pancreatic gland, is one of the most aggressive types of cancer with a high mortality rate, called pancreatic cancer. This research focuses on improving early diagnosis methods for pancreatic cancer by using CatBoost. Urine biomarker datasets were collected and subjected to pre-processing, including label coding, standardized scaling, and balancing via the Synthetic Minority Oversampling Technique (SMOTE). The CatBoost model achieved an accuracy of 98.89%, specificity of 99.35%, sensitivity of 98.71%, and Area Under the Curve (AUC) of 0.9951. These results show that the CatBoost model significantly outperforms the diagnosis models in previous studies, overcoming the challenges of early detection and classification of pancreatic cancer. This study shows that CatBoost is effective for diagnosing pancreatic cancer and suggests that future research explore other models on larger and more diverse datasets.

Article Details

How to Cite
Tanga, Y. M. P., Utami, P., Darmawan, A. Y., & Unjung, J. . (2026). Classification of Pancreatic Cancer Diagnosis with CatBoost Using Urine Biomarker Combination. Journal of Electronics Technology Exploration, 4(1), 1-9. https://doi.org/10.52465/joetex.v4i1.651
Section
Articles

References

A. Tayanloo-Beik et al., “OMICS insights into cancer histology; Metabolomics and proteomics approach,” Clin. Biochem., vol. 84, no. June, pp. 13–20, 2020.

Rofik, F. Artiyani, D. Ananda, and A. Pertiwi, “Journal of Information System Breast Cancer Diagnosis Utilizing Artificial Neural Network ( ANN ) Algorithm for Integrating Multi-Omics Data and Clinical Features,” vol. 2, no. 2, pp. 61–68, 2024.

Jumanto, M. F. Mardiansyah, R. Pratama, M. F. Al Hakim, and B. Rawat, “Optimization of breast cancer classification using feature selection on neural network,” pp. 105–110, 2022.

A. U. Dullah and F. N. Apsari, “Ensemble learning technique to improve breast cancer classification model,” pp. 107–113, 2023.

N. P. Damayanti, M. Nabiel, D. Ananda, and F. W. Nugraha, “Lung cancer classification using convolutional neural network and DenseNet,” pp. 133–141, 2023.

Z. Y. Zhao and W. Liu, “Pancreatic Cancer: A Review of Risk Factors, Diagnosis, and Treatment,” Technol. Cancer Res. Treat., vol. 19, pp. 1–13, 2020.

K. C. Ravi, R. R. Dixit, T. Indhumathi, S. Singh, A. Gopatoti, and A. S. Yadav, “AI-Powered Pancreas Navigator: Delving into the Depths of Early Pancreatic Cancer Diagnosis using Advanced Deep Learning Techniques,” IEEE 9th Int. Conf. Smart Struct. Syst. ICSSS 2023, pp. 1–6, 2023.

S. Debernardi et al., “A combination of urinary biomarker panel and PancRISK score for earlier detection of pancreatic cancer: A case-control study,” PLoS Med., vol. 17, no. 12, pp. 1–23, 2020.

A. Chakraborty and C. P. Tsokos, “An AI-driven Predictive Model for Pancreatic Cancer Patients Using Extreme Gradient Boosting,” J. Stat. Theory Appl., vol. 22, no. 4, pp. 262–282, 2023.

A. Gupta, A. Koul, and Y. Kumar, “Pancreatic Cancer Detection using Machine and Deep Learning Techniques,” Proc. 2nd Int. Conf. Innov. Pract. Technol. Manag. ICIPTM 2022, vol. 2, pp. 151–155, 2022.

D. Placido et al., “A deep learning algorithm to predict risk of pancreatic cancer from disease trajectories,” Nat. Med., vol. 29, no. 5, pp. 1113–1122, 2023.

İ. Acer, F. Orhanbulucu, S. Içer, and F. Latifoglu, “Early diagnosis of pancreatic cancer by machine learning methods using urine biomarker combinations,” Turkish J. Electr. Eng. Comput. Sci., vol. 31, no. 1, pp. 112–125, 2023.

K. Cao et al., “Large-scale pancreatic cancer detection via non-contrast CT and deep learning,” Nat. Med., vol. 29, no. 12, pp. 3033–3043, 2023.

M. E. Karar, N. El-Fishawy, and M. Radad, “Automated classification of urine biomarkers to diagnose pancreatic cancer using 1-D convolutional neural networks,” J. Biol. Eng., vol. 17, no. 1, pp. 1–12, 2023.

M. Dbouk et al., “The Multicenter Cancer of Pancreas Screening Study: Impact on Stage and Survival,” J. Clin. Oncol., vol. 4, no. 28, 2022.

M. Goggins, “The role of biomarkers in the early detection of pancreatic cancer,” Fam. Cancer, vol. 23, no. 3, pp. 309–322, 2024.

M. Heyen, “Diagnosing Pancreatic Cancer via Urinary Biomarkers,” no. 11, 2023.

O. Blyuss et al., “Development of PancRISK, a urine biomarker-based risk score for stratified screening of pancreatic cancer patients,” Br. J. Cancer, vol. 122, no. 5, pp. 692–696, 2020.

C. S. Kulkarni, “Advancing Gradient Boosting: A Comprehensive Evaluation of the CatBoost Algorithm for Predictive Modeling,” J. Artif. Intell. Mach. Learn. Data Sci., vol. 1, no. 5, pp. 54–57, 2022.

S. Ben Jabeur, C. Gharib, S. Mefteh-Wali, and W. Ben Arfi, “CatBoost model and artificial intelligence techniques for corporate failure prediction,” Technol. Forecast. Soc. Change, vol. 166, no. January, p. 120658, 2021.

A. M. Wyglinski, D. P. Orofino, M. N. Ettus, and T. W. Rondeau, “Revolutionizing software defined radio: case studies in hardware, software, and education,” IEEE Commun. Mag., vol. 54, no. 1, pp. 68–75, Jan. 2016.

J. T. Hancock and T. M. Khoshgoftaar, “Survey on categorical data for neural networks,” J. Big Data, vol. 7, no. 1, 2020.

W. A. Arifin, I. Ariawan, A. A. Rosalia, L. Lukman, and N. Tufailah, “Data scaling performance on various machine learning algorithms to identify abalone sex,” J. Teknol. dan Sist. Komput., vol. 10, no. 1, pp. 26–31, 2022.

B. Deepa and K. Ramesh, “Epileptic seizure detection using deep learning through min max scaler normalization,” Int. J. Health Sci. (Qassim)., vol. 6, no. April, pp. 10981–10996, 2022.

E. Boonchieng, V. Chouvatut, and D. Lipsky, “Article FLEX-SMOTE : Synthetic over-sampling technique that flexibly adjusts to different minority class distributions FLEX-SMOTE : Synthetic over-sampling technique that flexibly adjusts to different minority class distributions,” Patterns, p. 101073, 2024.

G. A. Pradipta, R. Wardoyo, A. Musdholifah, I. N. H. Sanjaya, and M. Ismail, “SMOTE for Handling Imbalanced Data Problem : A Review,” 2021 6th Int. Conf. Informatics Comput. ICIC 2021, pp. 1–8, 2021.

H. V Nerlekar, K. Moholkar, D. Wagh, and S. L. Bangare, “Integrating AI with Health Informatics for Early Detection of Pancreatic Cancer Frontiers in Health Informatics,” vol. 13, no. 2, pp. 204–217, 2024.

A. V. Dorogush, V. Ershov, and A. Gulin, “CatBoost : gradient boosting with categorical features support,” pp. 1–7.

N. Khodadadi, H. Roghani, F. De Caso, E. M. El-kenawy, Y. Yesha, and A. Nanni, “Thin-Walled Structures Data-driven PSO-CatBoost machine learning model to predict the compressive strength of CFRP- confined circular concrete specimens,” Thin-Walled Struct., vol. 198, no. February, p. 111763, 2024.

M. Grandini, E. Bagli, and G. Visani, “Metrics for Multi-Class Classification: an Overview,” pp. 1–17, 2020.

Abstract viewed = 0 times