Analysis of the Stacking Ensemble Learning Model of Categorical Boosting and Naïve Bayes Algorithms for Crop Selection Based on Soil Characteristics
Main Article Content
Abstract
This study aims to develop a machine learning model for selecting crop types based on soil characteristics, using the Categorical Boosting and Naïve Bayes algorithms as base learners. Next, an ensemble learning technique using a stacking approach was applied to improve the performance of the base model that was built. This was done to analyze and compare the performance results of each ensemble model that was carried out. Model performance was evaluated using evaluation metrics including precision, recall, f1-score, and accuracy. The results of this study indicate that the stacking ensemble model with Random Forest as the meta learner can provide better performance compared to other ensemble models. This model achieved a precision of 98.85337%, a recall of 99.84848%, an F1-score of 99.84844%, an accuracy of 99.84848%, and a model training time of 78.61110 seconds. Based on these results, this study is expected to provide tangible contributions and new knowledge in plant selection classification based on soil characteristics, thereby aiding in the precise and efficient determination of suitable plant types.
Article Details

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
References
K. P. R. Indonesia, “Keputusan Menteri Pertanian Republik Indonesia Nomor 484/KPTS/RC.020/M/8/2021 Tentang Perubahan Kedua Atas Keputusan Menteri Pertanian Nomor 259/KTPS/RC.020/M/05/2020 Tentang Rencana Strategis Kementrian Pertanian Tahun 2020-2024,” Jakarta, 2021. [Online]. Available: https://bpmsph.ditjenpkh.pertanian.go.id/wp-content/uploads/2023/10/RENSTRA-KEMENTAN-2020-2024-REVISI-2-26-Agt-2021.pdf
A. F. Negarawan, M. U. Siregar, A. Fatwanto, and M. D. R. Wahyudi, “An Implementation of Profile Matching Method to Determine Agricultural Crops that Suit the Land,” Proc. Int. Conf. Sci. Eng. (ICSE-UIN-SUKA 2021), vol. 211, pp. 124–129, 2022, doi: 10.2991/aer.k.211222.020.
Mahmudi, I. Sasli, and T. H. Ramadhan, “Growth and yield of rice from mycorrhizal enrichment seedlings on different soil water content,” Indones. J. Agron., vol. 51, no. 2, pp. 173–180, 2023, doi: 10.24831/ija.v51i2.46201.
B. Mahesh, “Machine Learning Algorithms - A Review,” Int. J. Sci. Res., vol. 9, no. 1, pp. 381–386, 2020, doi: 10.21275/art20203995.
Q. Liu and Y. Wu, “Encyclopedia of the Sciences of Learning,” Encycl. Sci. Learn., no. January 2012, 2012, doi: 10.1007/978-1-4419-1428-6.
P. J. B. Pajila, B. G. Sheena, A. Gayathri, J. Aswini, M. Nalini, and R. Siva Subramanian, “A Comprehensive Survey on Naive Bayes Algorithm: Advantages, Limitations and Applications,” Proc. 4th Int. Conf. Smart Electron. Commun. ICOSEC 2023, no. September 2023, pp. 1228–1234, 2023, doi: 10.1109/ICOSEC58147.2023.10276274.
R. Blanquero, E. Carrizosa, P. Ramírez-Cobo, and M. R. Sillero-Denamiel, “Variable selection for Naïve Bayes classification,” Comput. Oper. Res., vol. 135, p. 105456, 2021, doi: 10.1016/j.cor.2021.105456.
W. Chang, X. Wang, J. Yang, and T. Qin, “An Improved CatBoost-Based Classification Model for Ecological Suitability of Blueberries,” Sensors, vol. 23, no. 4, 2023, doi: 10.3390/s23041811.
C. Yan, X. Zhang, and J. Shen, “Credit Score Classification Using Advanced Machine Learning : A Comprehensive Approach,” vol. 18, no. 3, pp. 98–112, 2025, doi: 10.4236/jsea.2025.183007.
A. A. Ibrahim, R. L. Ridwan, M. M. Muhammed, R. O. Abdulaziz, and G. A. Saheed, “Comparison of the CatBoost Classifier with other Machine Learning Methods,” Int. J. Adv. Comput. Sci. Appl., vol. 11, no. 11, pp. 738–748, 2020, doi: 10.14569/IJACSA.2020.0111190.
M. R. P. Putra and E. Utami, “Comparative Analysis of Hybrid Model Performance Using Stacking and Blending Techniques for Student Drop Out Prediction In MOOC,” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 8, no. 3, pp. 346–354, 2024, doi: 10.29207/resti.v8i3.5760.
D. Dahiphale, P. Shinde, K. Patil, and V. Dahiphale, “Smart Farming: Crop Recommendation using Machine Learning with Challenges and Future Ideas,” 2023, doi: 10.36227/techrxiv.23504496.v1.
S. Sultana, T. Hasan, F. I. Tanjil, S. Islam, M. O. Rahman, and N. R. Chakraborty, “An IoT and Hybrid Machine Learning Based Real-time Web Framework for Optimizing Crop Recommendation,” CAMA Cent. Appl. Macroecon. Anal., vol. 13, no. January, pp. 909–920, 2023, doi: 10.2139/ssrn.5132627.
C. Vyapari, P. Bhosale, and C. Vyapari Prathamesh Bhosale Ameya Parkar, “Comparative Analysis of Machine Learning Algorithms for Crop Recommendations,” pp. 0–4, 2023, [Online]. Available: https://doi.org/10.21203/rs.3.rs-3181672/v1
P. Mahesh and R. Soundrapandiyan, “Yield prediction for crops by gradient-based algorithms,” PLoS One, vol. 19, no. 8, pp. 1–20, 2024, doi: 10.1371/journal.pone.0291928.
H. Afzal et al., “Incorporating soil information with machine learning for crop recommendation to improve agricultural output,” Sci. Rep., vol. 15, no. 1, pp. 1–15, 2025, doi: 10.1038/s41598-025-88676-z.
H. Wang and L. Cheng, “C AT B OOST MODEL WITH SYNTHETIC FEATURES IN APPLICATION TO LOAN RISK ASSESSMENT OF SMALL,” 2021.
S. Hussain, M. Wazir, T. A. Jumani, and S. Khan, “A novel feature engineered-CatBoost-based supervised machine learning framework for electricity theft detection,” Energy Reports, vol. 7, pp. 4425–4436, 2021, doi: 10.1016/j.egyr.2021.07.008.
L. Prokhorenkova, G. Gusev, A. Vorobev, A. V. Dorogush, and A. Gulin, “CatBoost : unbiased boosting with categorical features,” no. Section 4, pp. 1–23, 2019.
R. Taherdangkoo et al., “Modeling unsaturated hydraulic conductivity of compacted bentonite using a constrained CatBoost with bootstrap analysis,” Appl. Clay Sci., vol. 260, no. March, p. 107530, 2024, doi: 10.1016/j.clay.2024.107530.
X. Huang, W. Liu, Q. Guo, and J. Tan, “Prediction method for the dynamic response of expressway lateritic soil subgrades on the basis of Bayesian optimization CatBoost,” Soil Dyn. Earthq. Eng., vol. 186, no. September, p. 108943, 2024, doi: 10.1016/j.soildyn.2024.108943.
H. Chen, S. Hu, R. Hua, and X. Zhao, “Improved naive Bayes classification algorithm for traffic risk management,” EURASIP J. Adv. Signal Process., vol. 2021, no. 1, 2021, doi: 10.1186/s13634-021-00742-6.
J. S. A. Ruiz, C. Romero, and A. Cicconardi, “XNB: Explainable Class-Specific NaIve-Bayes Classifier,” 2024, [Online]. Available: http://arxiv.org/abs/2411.01203
T. Miquélez, E. Bengoetxea, and P. Larrañaga, “Evolutionary computation based on Bayesian classifiers,” Int. J. Appl. Math. Comput. Sci., vol. 14, no. 3, pp. 335–349, 2004, [Online]. Available: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.110.3623&rep=rep1&type=pdf
L. Wu et al., “Prediction model for the compressive strength of rock based on stacking ensemble learning and shapley additive explanations,” vol. 123, 2024, doi: 10.1007/s10064-024-03896-3.
Q. Li and X. Wang, “Bayesian optimization of stacking ensemble learning model for HPC compressive strength prediction,” Expert Syst. Appl., vol. 288, no. May, p. 128281, 2025, doi: 10.1016/j.eswa.2025.128281.
A. Ingle, “Crop Recommendation Dataset.” Accessed: Jan. 13, 2025. [Online]. Available: https://www.kaggle.com/datasets/atharvaingle/crop-recommendation-dataset
M. Vakili, M. Ghamsari, and M. Rezaei, “Performance Analysis and Comparison of Machine and Deep Learning Algorithms for IoT Data Classification,” no. January, 2020, doi: 10.48550/arXiv.2001.09636.
A. Mohammed and R. Kora, “A comprehensive review on ensemble deep learning: Opportunities and challenges,” J. King Saud Univ. - Comput. Inf. Sci., vol. 35, no. 2, pp. 757–774, 2023, doi: 10.1016/j.jksuci.2023.01.014.
D. M. W. Powers, “Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation,” pp. 37–63, 2020, [Online]. Available: http://arxiv.org/abs/2010.16061
I. Markoulidakis and G. Markoulidakis, “Probabilistic Confusion Matrix: A Novel Method for Machine Learning Algorithm Generalized Performance Analysis,” Technologies, vol. 12, no. 7, 2024, doi: 10.3390/technologies12070113.
N. N. Thilakarathne, M. S. A. Bakar, P. E. Abas, and H. Yassin, “A Cloud Enabled Crop Recommendation Platform for Machine Learning-Driven Precision Farming,” 2022. doi: 10.3390/s22166299.
K. Bakthavatchalam et al., “IoT Framework for Measurement and Precision Agriculture: Predicting the Crop Using Machine Learning Algorithms,” Technologies, vol. 10, no. 1, 2022, doi: 10.3390/technologies10010013.
S. Chunduri, H. Raj, and V. G. Narendra, “Intelligent Systems for Crop Recommendation using Machine Learning,” WSEAS Trans. Comput., vol. 24, pp. 14–19, 2025, doi: 10.37394/23205.2025.24.2.
R. K. Banoth and R. B. V. Murthy, “Automatic Crop Recommendation System Using LightGBM and Decision Tree Machine Learning Models,” J. Mach. Comput., vol. 5, no. 1, pp. 343–355, 2025, doi: 10.53759/7669/jmc202505026.
M. Bouni, B. Hssina, K. Douzi, and S. Douzi, “Interpretable Machine Learning Techniques for an Advanced Crop Recommendation Model,” J. Electr. Comput. Eng., vol. 2024, 2024, doi: 10.1155/2024/7405217.