Hoax classification in indonesian language with bidirectional temporal convolutional network architecture
Main Article Content
Abstract
The increasingly massive rate of information dissemination in cyberspace has had several negative impacts, one of which is the increased vulnerability to the spread of hoaxes. Hoax has seven classifications. Classification problems such as hoax classification can be automated using the application of the Deep Learning model. Bidirectional Temporal Convolutional Network (Bi-TCN) is a type of Deep Learning architectural model that is very suitable for text classification cases. Because the architecture uses dilation factors in its feature extraction so it can generate exceptionally large receptive fields and is supported by Bidirectional aggregation to ensure that the model can learn long-term dependencies without storing duplicate context information. The purpose of this study is to evaluate the performance of Bi-TCN architecture combined with pre-trained FastText embedding model for hoax classification in Indonesian and implement the resulting model on website. Based on the research that has been done, the model with Bi-TCN architecture has satisfactory performance with an accuracy score of 92.98% and a loss value that can be reduced to 0.191. Out of a total of 13,673 data tested with this model, only 414 data or in other words around 3% of the total data were incorrect predictions.
Downloads
Article Details
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
References
A. Yuliani, “Ada 800.000 Situs Penyebar Hoax di Indonesia,” Kominfo RI, 2017. https://www.kominfo.go.id/content/detail/12008/ada-800000-situs-penyebar-hoax-di-indonesia/0/sorotan_media (accessed May 09, 2021).
Badan Penelitian dan Pengembangan, “Riset: 44 Persen Orang Indonesia Belum Bisa Mendeteksi Berita Hoax,” Kemendagri RI, 2018. https://litbang.kemendagri.go.id/website/riset-44-persen-orang-indonesia-belum-bisa-mendeteksi-berita-hoax-2/ (accessed Dec. 28, 2022).
C. Wardle, “Fake news. It’s complicated.,” First Draft, 2017. https://firstdraftnews.org/articles/fake-news-complicated/ (accessed May 09, 2021).
C. Khontoro, J. Andjarwirawan, and Yulia, “Penerapan Algoritma TextRank dan Dice Similarity Untuk Verifikasi Berita Hoax,” Jurnal Infra, vol. 9, no. 1, pp. 98–102, 2021.
J. P. Haumahu, S. D. H. Permana, and Y. Yaddarabullah, “Fake news classification for Indonesian news using Extreme Gradient Boosting (XGBoost),” IOP Conf Ser Mater Sci Eng, vol. 1098, no. 5, p. 052081, Mar. 2021, doi: 10.1088/1757-899x/1098/5/052081.
B. P. Nayoga, R. Adipradana, R. Suryadi, and D. Suhartono, “Hoax Analyzer for Indonesian News Using Deep Learning Models,” in Procedia Computer Science, 2021, vol. 179, pp. 704–712. doi: 10.1016/j.procs.2021.01.059.
T. Trisna Astono Putri, H. S. Warra, I. Yanti Sitepu, and M. Sihombing, “Analysis and Detection of Hoax Contents in Indonesian News Based on Machine Learning,” Journal Of Informatics Pelita Nusantara, vol. 4, no. 1, 2019.
H. Mustofa and A. A. Mahfudh, “Klasifikasi Berita Hoax Dengan Menggunakan Metode Naive Bayes,” Walisongo Journal of Information Technology, vol. 1, no. 1, p. 1, Nov. 2019, doi: 10.21580/wjit.2019.1.1.3915.
I. Y. R. Pratiwi, R. A. Asmara, and F. Rahutomo, “Study of Hoax News Detection using Naïve Bayes Classifier in Indonesian Language,” in Proceedings of the 11th International Conference on Information and Communication Technology and System, ICTS 2017, Jan. 2018, vol. 2018-January, pp. 73–78. doi: 10.1109/ICTS.2017.8265649.
E. Zuliarso, M. T. Anwar, K. Hadiono, and I. Chasanah, “Detecting Hoaxes in Indonesian News Using TF/TDM and K Nearest Neighbor,” in IOP Conference Series: Materials Science and Engineering, May 2020, vol. 835, no. 1. doi: 10.1088/1757-899X/835/1/012036.
A. B. Prasetijo, R. R. Isnanto, D. Eridani, Y. A. A. Soetrisno, M. Arfan, and A. Sofwan, “Hoax Detection System on Indonesian News Sites Based on Text Classification using SVM and SGD,” in International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE), Oct. 2017, pp. 45–49.
A. A. Kurniawan and M. Mustikasari, “Implementasi Deep Learning Menggunakan Metode CNN dan LSTM untuk Menentukan Berita Palsu dalam Bahasa Indonesia,” Jurnal Informatika Universitas Pamulang, vol. 5, no. 4, pp. 544–552, Oct. 2020, doi: 10.32493/informatika.v5i4.7760.
R. Adipradana, B. P. Nayoga, R. Suryadi, and D. Suhartono, “Hoax Analyzer for Indonesian News using RNNs with Fasttext and Glove Embeddings,” Bulletin of Electrical Engineering and Informatics, vol. 10, no. 4, pp. 2130–2136, Aug. 2021, doi: 10.11591/eei.v10i4.2956.
A. Apriliyanto and R. Kusumaningrum, “Hoax Detection in Indonesian Language using Long Short-Term Memory Model,” Sinergi, vol. 24, no. 3, pp. 189–196, Jul. 2020, doi: 10.22441/sinergi.2020.3.003.
S. Bai, J. Z. Kolter, and V. Koltun, “An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling,” Mar. 2018, [Online]. Available: http://arxiv.org/abs/1803.01271
L. Nanni, A. Lumini, A. Manfè, R. Rampon, S. Brahnam, and G. Venturin, “Gated Recurrent Units and Temporal Convolutional Network for Multilabel Classification,” 2021.
K. He, X. Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 770–778. [Online]. Available: http://image-net.org/challenges/LSVRC/2015/
C. Zhong, L. Jiang, Y. Liang, H. Sun, and C. Ma, “Temporal Multiple-convolutional Network for Commodity Classification of Online Retail Platform Data,” in ACM International Conference Proceeding Series, Feb. 2020, pp. 236–241. doi: 10.1145/3383972.3383989.
Y. Liang, J. Kang, Z. Yu, B. Guo, X. Zheng, and S. He, “Leverage Temporal Convolutional Network for theRepresentation Learning of URLs,” in 2019 IEEE International Conference on Intelligence and Security Informatics (ISI), Jul. 2019, pp. 74–79.
J. Sun, X. Luo, H. Gao, W. Wang, Y. Gao, and X. Yang, “Categorizing Malware via A Word2Vec-based Temporal Convolutional Network Scheme,” Journal of Cloud Computing, vol. 9, no. 1, Dec. 2020, doi: 10.1186/s13677-020-00200-y.
Y. Zuo et al., “Short Text Classification Based on Bidirectional TCN and Attention Mechanism,” in Journal of Physics: Conference Series, Dec. 2020, vol. 1693, no. 1. doi: 10.1088/1742-6596/1693/1/012067.
B. Jang, M. Kim, G. Harerimana, S. U. Kang, and J. W. Kim, “Bi-LSTM Model to Increase Accuracy in Text Classification: Combining Word2vec CNN and Attention Mechanism,” Applied Sciences (Switzerland), vol. 10, no. 17, pp. 5841–5854, Sep. 2020, doi: 10.3390/app10175841.
J. Jumanto, M. A. Muslim, Y. Dasril, and T. Mustaqim, “Accuracy of Malaysia Public Response to Economic Factors During the Covid-19 Pandemic Using Vader and Random Forest,” Journal of Information System Exploration and Research, vol. 01, no. 01, pp. 49–70, 2023, doi: 10.00000/joiser.0000.00.00.000.
K. He, Y. Yan, and W. Xu, “Learning to Tag OOV Tokens by Integrating Contextual Representation and Background Knowledge,” in Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Jul. 2020, pp. 619–624.
M. Heidarysafa, K. Kowsari, D. E. Brown, K. J. Meimandi, and L. E. Barnes, “An Improvement of Data Classification using Random Multimodel Deep Learning (RMDL),” Int J Mach Learn Comput, vol. 8, no. 4, pp. 298–310, Aug. 2018, doi: 10.18178/ijmlc.2018.8.4.703.
O. Calin, Deep Learning Architectures. Cham: Springer Nature Switzerland AG, 2020. doi: 10.1007/978-3-030-36721-3.
A. D. Lestari, N. A. Syarifudin, and Y. J. Nurriski, “Application of pest detection on vegetable crops using the cnn algorithm as a smart farm innovation to realize food security in the 4.0 era,” Journal of Soft Computing Exploration, vol. 3, no. 2, Sep. 2022, doi: 10.52465/joscex.v3i2.72.
H. Aji Prihanditya and N. Hestu Aji Prihanditya, “The Implementation of Z-Score Normalization and Boosting Techniques to Increase Accuracy of C4.5 Algorithm in Diagnosing Chronic Kidney Disease,” Journal of Soft Computing Exploration, vol. 1, no. 1, pp. 63–69, 2020.
H. El-Amir and M. Hamdy, Deep Learning Pipeline. New York: Apress Media LLC, 2020. doi: 10.1007/978-1-4842-5349-6.
B. Xiao, Y. Liu, and B. Xiao, “Accurate State-of-charge Estimation Approach for Lithium-ion Batteries by Gated Recurrent Unit with Ensemble Optimizer,” IEEE Access, vol. 7, pp. 54192–54202, 2019, doi: 10.1109/ACCESS.2019.2913078.
A. Vieira and B. Ribeiro, Introduction to Deep Learning Business Applications for Developers: From Conversational Bots in Customer Service to Medical Image Processing. New York: Apress Media LLC, 2018. doi: 10.1007/978-1-4842-3453-2.
C. Zong, R. Xia, and J. Zhang, Text Data Mining. Singapore: Tsinghua University Press, 2021. doi: 10.1007/978-981-16-0100-2.
S. Yamaguchi et al., “Web Services for Collaboration Analysis with IoT Badges,” IEEE Access, vol. 10, pp. 121318–121328, 2022, doi: 10.1109/ACCESS.2022.3222562.
A. M. Potdar, D. G. Narayan, S. Kengond, and M. M. Mulla, “Performance Evaluation of Docker Container and Virtual Machine,” Procedia Comput Sci, vol. 171, pp. 1419–1428, 2020, doi: 10.1016/j.procs.2020.04.152.
Heroku, “Deploying with Docker | Heroku Dev Center.” https://devcenter.heroku.com/categories/deploying-with-docker (accessed Jan. 18, 2023).
N. Li and B. Zhang, “The Research on Single Page Application Front-end Development Based on Vue,” J Phys Conf Ser, vol. 1883, no. 1, Apr. 2021, doi: 10.1088/1742-6596/1883/1/012030.
A. Safonyk, M. Mishchanchuk, V. I. Lytvynenko, and V. Lytvynenko, “Intelligent Information System for The Determination of Iron in Coagulantsbased on a Neural Network,” in 2nd International Workshop on Intelligent Information Technologies and Systems of Information Security (IntelITSIS), Mar. 2021, pp. 142–150. [Online]. Available: https://www.researchgate.net/publication/351844460
Y. Quan, “Design and Implementation of E-commerce Platform based on Vue.js and MySQL,” 3rd International Conference on Computer Engineering, Information Science & Application Technology (ICCIA 2019), vol. 90, pp. 449–454, 2019.
Y. Bao, C. Zhang, and Q. Shi, “Mining and Analysis Based on Big Data in Public Transportation,” 11th International Symposium on Intelligence Computation and Applications (ISICA 2019), vol. 1205, pp. 681–688, 2020, doi: 10.1007/978-981-15-5577-0.
E. Wohlgethan, “Supporting Web Development Decisions by Comparing Three Major JavaScript Frameworks: Angular, React, and Vuejs,” PhD Thesis, Hamburg University of Applied Sciences, Hamburg, 2018.
I. Maramba, A. Chatterjee, and C. Newman, “Methods of Usability Testing in The Development of eHealth Applications: A Scoping Review,” Int J Med Inform, vol. 126, pp. 95–104, Jun. 2019, doi: 10.1016/j.ijmedinf.2019.03.018.
C. Juditha and J. J. Darmawan, “Infodemik Di Masa Pandemi: Analisis Peta Hoaks Covid-19 Tahun 2020,” Pekommas, vol. 6, no. The, pp. 66–67, 2020, doi: 10.30818/jpkm.2021.2060307.