YouTube Comment Sentiment Analysis on Deddy Corbuzier and BEM UI’s Podcast Using TF-IDF and Naïve Bayes

Main Article Content

Muhammad Rafi Al Basyary

Abstract

The development of technology and the popularity of digital platforms such as Youtube have had a significant impact on the dissemination of information, especially related to the idea of a presidential candidate in the 2024 election. For example, Deddy Corbuzier actively uploads a podcast video on his Youtube channel, together with the chairman of BEM UI, to discuss the ideas of each presidential candidate. In this study, sentiment analysis was carried out on more than 2.6 million Youtube user comments on the video using the Naïve Bayes Classifier algorithm. This algorithm has proven effective in previous studies, showing high accuracy in classifying people's sentiments. The research methodology includes data labeling, text preprocessing, word weighting with TF-IDF, data validation using k-fold cross validation, and data testing. The results of the sentiment analysis revealed that more than fifty percent of the comments were positive, while some remained neutral. The data visualization process using word cloud provides a clear picture of the topics most talked about by the public, with the word "Leader" dominating. Although the accuracy of the Naïve Bayes Classifier in this study reached 57.6%, this study provides valuable insights into the public's view of the idea of a presidential candidate. Further development may involve the use of other algorithms as comparators to improve the accuracy of sentiment analysis.

Article Details

How to Cite
Al Basyary, M. R. . (2024). YouTube Comment Sentiment Analysis on Deddy Corbuzier and BEM UI’s Podcast Using TF-IDF and Naïve Bayes. SAINTEKBU, 16(02), 10–17. Retrieved from https://ejournal.unwaha.ac.id/index.php/saintek/article/view/4352

References

  1. Aziz, A., & Fauziah, F. (2022). Analisis Sentimen Identifikasi Opini Terhadap Produk, Layanan dan Kebijakan Perusahaan Menggunakan Algoritma TF-IDF dan SentiStrength. J-SAKTI (Jurnal Sains Komputer Dan Informatika), 6(1), 115–125.
  2. Baskoro, B. B., Susanto, I., & Khomsah, S. (2021). Analisis Sentimen Pelanggan Hotel di Purwokerto Menggunakan Metode Random Forest dan TF-IDF (Studi Kasus: Ulasan Pelanggan Pada Situs TRIPADVISOR). INISTA (Journal of Informatics Information System Software Engineering and Applications), 3(2), 21–29.
  3. Gifari, O. I., Adha, Muh., Freddy, F., & Durrand, F. F. S. (2022). Analisis Sentimen Review Film Menggunakan TF-IDF dan Support Vector Machine. Journal of Information Technology, 2(1), 36–40. https://doi.org/10.46229/jifotech.v2i1.330
  4. Kosasih, R., & Alberto, A. (2021). Analisis Sentimen Produk Permainan Menggunakan Metode TF-IDF Dan Algoritma K-Nearest Neighbor. InfoTekJar: Jurnal Nasional Informatika Dan Teknologi Jaringan, 6(1), 134–139.
  5. KPU. (2023, December 13). KPU Tetapkan Tiga Pasangan Calon Presiden dan Wakil Presiden Pemilu 2024. https://www.kpu.go.id/berita/baca/12081/kpu-tetapkan-tiga-pasangan-calon-presiden-dan-wakil-presiden-pemilu-2024#:~:text=Selaku%20Ketua%20Divisi%20Teknis%20Penyelenggaraan%2C%20Idham%20menyampaikan%2C%20KPU,Ganjar%20Pranowo-Mahfud%20MD%2C%20serta%20Prabowo%20Subianto-Gibran%20Rakabuming%20Raka.
  6. Merinda Lestandy, Abdurrahim Abdurrahim, & Lailis Syafa’ah. (2021). Analisis Sentimen Tweet Vaksin COVID-19 Menggunakan Recurrent Neural Network dan Naïve Bayes. Jurnal RESTI (Rekayasa Sistem Dan Teknologi Informasi), 5(4), 802–808. https://doi.org/10.29207/resti.v5i4.3308
  7. Sari, F. V., & Wibowo, A. (2019). Analisis sentimen pelanggan toko online Jd. Id menggunakan metode Naïve Bayes Classifier berbasis konversi ikon emosi. Simetris: Jurnal Teknik Mesin, Elektro Dan Ilmu Komputer, 10(2), 681–686.
  8. Septian, J. A., Fachrudin, T. M., & Nugroho, A. (2019). Analisis Sentimen Pengguna Twitter Terhadap Polemik Persepakbolaan Indonesia Menggunakan Pembobotan TF-IDF dan K-Nearest Neighbor. Journal of Intelligent System and Computation, 1(1), 43–49. https://doi.org/10.52985/insyst.v1i1.36
  9. Silalahi, N., & Ginting, G. L. (2022). Analisa Sentimen Masyarakat Dalam Penggunaan Vaksin Sinovac Dengan Menerapkan Algoritma Term Frequence – Inverse Document Frequence (TF-IDF) dan Metode Deskripsi. Journal of Information System Research (JOSH), 3(3), 206–217. https://doi.org/10.47065/josh.v3i3.1441
  10. Tupari, T., Abdullah, S., & Chairani, C. (2023). Visualisasi Data Analisa Sentimen RUU Omnibus Law Kesehatan Menggunakan KNN dengan Software RapidMiner. Jurnal Informatika: Jurnal Pengembangan IT, 8(3), 261–268. https://doi.org/10.30591/jpit.v8i3.5641
  11. Yutika, C. H., Adiwijaya, A., & Faraby, S. Al. (2021). Analisis Sentimen Berbasis Aspek pada Review Female Daily Menggunakan TF-IDF dan Naïve Bayes. JURNAL MEDIA INFORMATIKA BUDIDARMA, 5(2), 422. https://doi.org/10.30865/mib.v5i2.2845
  12. Zhafira, D. F., Rahayudi, B., & Indriati, I. (2021). Analisis Sentimen Kebijakan Kampus Merdeka Menggunakan Naive Bayes dan Pembobotan TF-IDF Berdasarkan Komentar pada Youtube. Jurnal Sistem Informasi, Teknologi Informasi, Dan Edukasi Sistem Informasi, 2(1). https://doi.org/10.25126/justsi.v2i1.24