Topic modelling analysis of public policy narratives on prabowo-gibran in national news
Main Article Content
Abstract
The rapid acceleration of digital transition has become an inevitable reality of the modern era. The proliferation of online communication platforms, news portals, and heterogeneous data formats has substantially increased big data volumes, leading to large-scale collections of unstructured data. This study aims to analyze dominant public policy–related topics concerning the Prabowo–Gibran administration by applying topic modeling techniques to national online news media. Latent Dirichlet Allocation (LDA) and Non-negative Matrix Factorization (NMF) were employed as unsupervised learning approaches to extract latent semantic structure from a corpus of 200 credible news articles collected through URL fetching using Python 3. Data preprocessing included text cleaning, tokenization, bigram and trigram construction, and the development of a dictionary and corpus. Model performance was evaluated using topic coherence metrics, yielding scores of 0.3709 for LDA and 0.68 for NMF. To examine temporal dynamics, the dataset was divided based on the official inauguration date of the president and vice president, enabling a comparative analysis of dominant topics before and after the inauguration. Topic similarity across both periods was measured using cosine similarity, with the highest similarity score of 0.663 observed between Topic 4 in the pre-inauguration period and Topic 1 in the post-inauguration period. The findings provide insights into evolving media discourse and policy-related topic trends across the two periods, demonstrating the potentials of topic modeling in analyzing large-scale unstructured news data for diverse purposes to bridge computational science and empirical evidence of social science.
Downloads
Article Details

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
References
A. Dharma and N. Hendri, “Urgensi Penggunaan Big Data Analytics dalam Audit Sektor Publik,” vol. 18, pp. 107–120, 2022.
H. Hassani, C. Beneki, S. Unger, M. T. Mazinani, and M. R. Yeganegi, “Text mining in big data analytics,” Big Data Cogn. Comput., vol. 4, no. 1, pp. 1–34, 2020.
S. Bagaskoro, I. Amrozi, and A. R. Ramadhan, “Political Sentiment Analysis,” J. PolGov, vol. 4, no. 2, pp. 189–230, 2022.
S. Daud, M. Ullah, A. Rehman, T. Saba, R. Damaševičius, and A. Sattar, “Topic Classification of Online News Articles Using Optimized Machine Learning Models,” Computers, vol. 12, no. 1, 2023.
N. Newman, A. R. Arguedas, C. T. Robertson, R. K. Nielsen, and R. Fletcher, “Reuters Institute Digital News Report 2025.” 2025.
A. F. Hidayatullah, M. R. Ma’arif, M. Habibie, and S. Khomsah, “Indonesia Infrastructure Development Topic Discovery on Online News with Latent Dirichlet Allocation,” in IOP Conference Series: Materials Science and Engineering, 2021, vol. 1077, no. 1, p. 12012.
D. Antons, E. Grünwald, P. Cichy, and T. O. Salge, “The application of text mining methods in innovation research: current state, evolution patterns, and development priorities,” R&D Manag., pp. 329–351, 2020.
P. Ghasiya and K. Okamura, “Investigating COVID-19 News across Four Nations: A Topic Modeling and Sentiment Analysis Approach,” IEEE Access, vol. 9, pp. 36645–36656, 2021.
E. M. S. Saleh, N. A. Rahman, and D. H. Santoso, How Trends Shape the Media Landscape. Universiti Pendidikan Sultan Idris.
M. A. Ramdhani, M. A. Ramdhani, D. S. A. Maylawati, and T. Mantoro, “Indonesian news classification using convolutional neural network,” Indones. J. Electr. Eng. Comput. Sci., vol. 19, no. 2, pp. 1000–1009, 2020.
H. S. Maghdid, “Web News Mining Using New Features: A Comparative Study,” IEEE Access, vol. 7, pp. 5626–5641, 2019.
L. L. Wang and K. Lo, “Text mining approaches for dealing with the rapidly expanding literature on COVID-19,” Brief. Bioinform., 2021.
R. Churchill and L. Singh, “The Evolution of Topic Modeling,” ACM Comput. Surv., vol. 54, no. 10, 2022.
W. H. Rochmawati and others, “Analisis Persepsi Masyarakat Terhadap Komunikasi Kebijakan Menggunakan Topic Modelling,” 2022.
R. Raditya, “Perumusan Isu Strategis Kebijakan Investasi pada Sektor Pariwisata di Indonesia Berbasis Berita Media Online,” Multiverse Open Multidiscip. J., vol. 2, no. 3, pp. 346–362, 2023.
S. I. Ishak, O. Arnilia, T. Widodo, and I. G. N. A. B. Tatwa, “Analisis Sentimen terhadap Pemerintahan Prabowo--Gibran menggunakan IndoBERT dan LDA,” Jambura J. Informatics, vol. 1, no. 2, pp. 72–82, 2025.
W. Wahyuni, T. P. Lestari, M. Apriliana, and R. Gumelta, “Implementation of BERTopic for Topic Modeling Analysis of the Free Nutritious Meal Program Based on YouTube Comments,” 2025.
C. Suhaeni, L. Nissa, A. Mualifah, and H. Wijayanto, “LDA Topic Modeling Analysis of Public Discourse on Indonesia’s Free Nutritious Meals Program (MBG),” Int. J. Informatics Dev., vol. 14, no. 1, pp. 587–600, 2025.
D. T. Attaulah and D. Soyusiawaty, “Analisis Sentimen Program Makan Siang Gratis di Twitter/X menggunakan Metode BI-LSTM,” Edumatic J. Pendidik. Inform., vol. 9, no. 1, pp. 294–303, 2025.
D. Ayu, P. Wulandari, G. W. Wardhana, I. D. G. Puja, and W. Sindu, “Sentiment Analysis of the Free Lunch Program by Prabowo-Gibran Using the Naive Bayes Classifier,” Sist. Kendali & Jar., vol. 4, 2025.
A. Farkhod, A. Abdusalomov, F. Makhmudov, and Y. I. Cho, “LDA-based topic modeling sentiment analysis using topic/document/sentence (TDS) model,” Appl. Sci., vol. 11, no. 23, 2021.
R. Egger and J. Yu, “A Topic Modeling Comparison Between LDA, NMF, Top2Vec, and BERTopic to Demystify Twitter Posts,” Front. Sociol., vol. 7, 2022.
C. Meaney and others, “Non-negative matrix factorization temporal topic models and clinical text data identify COVID-19 pandemic effects,” J. Biomed. Inform., vol. 128, 2022.
R. Debnath and R. Bardhan, “India nudges to contain COVID-19 pandemic: A reactive public policy analysis using machine-learning based topic modelling,” PLoS One, vol. 15, no. 9, 2020.
E. Mayor and A. Miani, “A topic models analysis of the news coverage of the Omicron variant in the United Kingdom press,” BMC Public Health, vol. 23, no. 1, 2023.
G. Wiedemann, The World of Topic Modeling in R. Nomos, 2022.
D. Gunawan, C. A. Sembiring, and M. A. Budiman, “The Implementation of Cosine Similarity to Calculate Text Relevance between Two Documents,” in Journal of Physics: Conference Series, 2018, vol. 978, no. 1, p. 12120.