Movie Rating Prediction Analysis on Netflix Using Support Vector Machine Algorithm
Keywords:
Data Mining, Netflix, Rating Prediction, RapidMiner, Support Vector MachineAbstract
The rapid rise of digital streaming platforms such as Netflix has created a demand for accurate movie rating prediction systems to enhance user experience and content personalization. This study investigates the performance of the Support Vector Machine (SVM) algorithm in classifying movie ratings using historical metadata, including genre, duration, release year, and country of production. A quantitative approach is employed using RapidMiner for data mining and model evaluation. The dataset, sourced from publicly available open-access repositories, underwent preprocessing steps such as normalization, transformation of categorical attributes, and data splitting. Experimental results show that the SVM model achieved an accuracy of 83.10%. Furthermore, positive precision reached 82.98% and recall reached 100%, yielding a positive F1-score of 90.61%. However, the model exhibited limitations in detecting negative ratings, as indicated by a negative recall of only 3.98% despite achieving 100% negative precision, resulting in a low negative F1-score of 7.65%. These findings suggest that while SVM performs well in identifying positive instances, further improvement is needed for balanced classification in real-world movie rating prediction systems.
References
Arifin, N., Enri, U., & Sulistiyowati, N. (2021). Application of Support Vector Machine (SVM) algorithm with TF-IDF n-gram for text classification. STRING (Unit of Research Writing and Technological Innovation), 6(2), 129. https://doi.org/10.30998/string.v6i2.10133
Farid Naufal, M. (2021). Comparison and analysis of SVM algorithm and CNN. Journal of Information Technology and Computer Science, 8(2), 311–318. https://doi.org/10.25126/jtiik.202184553
Hanafi, H. (2023). Data cleaning in big data: Review. [Unpublished manuscript], 1–5.
Handayani, A., & Zufria, I. (2023). Sentiment analysis of Indonesia 2024 presidential candidates on Twitter using SVM algorithm. Journal of Information System Research (JOSH), 5(1), 53–63. https://doi.org/10.47065/josh.v5i1.4379
Harahap, N., & Nasution, L. (2022). The effect of employee work professionalism on public service quality at the Medan City BPS Office. Student Business Journal, 2(1), 43–52.
Hodson, T. O. (2022). Root-mean-square error (RMSE) or mean absolute error (MAE): When to use them or not. Geoscientific Model Development, 15(14), 5481–5487. https://doi.org/10.5194/gmd-15-5481-2022
Huda, A. S., Nafsika, S. S., & Salman, S. (2023). Film as a media in changing the way of human view in humanitarian principles. Irama: Journal of Art Design and Learning, 5(1), 9–14.
Husada, H. C., & Paramita, A. S. (2021). Sentiment analysis on airlines on the Twitter platform using the Support Vector Machine (SVM) algorithm. Teknika, 10(1), 18–26. https://doi.org/10.34148/teknika.v10i1.311
Ilmi, R. R., Kurniawan, F., & Harini, S. (2023). IMDb film rating prediction using decision tree. Journal of Information Technology and Computer Science, 10(4), 791–798. https://doi.org/10.25126/jtiik.2023106615
Mahendrajaya, R., Buntoro, G. A., & Setyawan, M. B. (2020). Sentiment analysis of GoPay users using lexicon-based and Support Vector Machine methods. Computer, 3(2), 52. https://doi.org/10.24269/jkt.v3i2.270
Mahendro, I., & Abimanto, D. (2022). Analysis of student satisfaction with e-learning using Support Vector Machine algorithm. Journal of Maritime Science and Technology, 23(1), 97. https://doi.org/10.33556/jstm.v23i1.333
Mujilahwati, S., Nawafilah, N. Q., & Aliyudin, M. (2020). Analysis of prediction results with the PROMETHEE method. Mnemonic Journal, 2(1), 35–40. https://doi.org/10.36040/mnemonic.v2i1.49
Octaviani, P. A., Wilandari, Y., & Ispiriyanti, D. (2020). Application of SVM method on elementary school accreditation data in Magelang Regency. Gaussian Journal, 3(8), 811–820.
Priskilla, R., & Suparni, S. (2024). Analysis of public sentiment towards Netflix online movie streaming application with Naïve Bayes method. JATI (Journal of Informatics Engineering Students), 8(2), 1339–1342. https://doi.org/10.36040/jati.v8i2.7520
Rafi Nahjan, M., Heryana, N., & Voutama, A. (2023). Implementation of RapidMiner with K-Means clustering method for sales analysis at Oj Cell Store. JATI (Journal of Informatics Engineering Students), 7(1), 101–104. https://doi.org/10.36040/jati.v7i1.6094
Rahmadiani, P., & Kusrini, E. (2023). Operator performance analysis using overall labor effectiveness method with root cause analysis approach. Asian Journal of Social and Humanities, 1(11), 918–927. https://doi.org/10.59888/ajosh.v1i11.106
Romadloni, P., Kusuma, B. A., & Baihaqi, W. M. (2022). Comparison of machine learning methods for implementation of decision making in determining employee promotion. JATI (Journal of Informatics Engineering Students), 6(2), 622–628. https://doi.org/10.36040/jati.v6i2.5238
Sari, N. P. P. A., Suryawati, I. G. A. A. S., & Pradipta, A. D. (2021). Motives and satisfaction of Netflix users as streaming media among teenagers in Denpasar City. Journal of Communication Science MEDIUM, 1(1), 1.
Siti Aisah, I., Irawan, B., & Suprapti, T. (2024). Support Vector Machine (SVM) algorithm for sentiment analysis of digital Qur'an application reviews. JATI (Journal of Informatics Engineering Students), 7(6), 3759–3765. https://doi.org/10.36040/jati.v7i6.8263
Widiari, N. P. A., Suarjaya, I. M. A. D., & Githa, D. P. (2020). Data cleaning technique using Snowflake for case study of tourism objects in Bali. Merpati Scientific Journal (Information Technology Academics Research Tower), 8(2), 137. https://doi.org/10.24843/jim.2020.v08.i02.p07
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Nanda Syamia, Ayu Pratiwi

This work is licensed under a Creative Commons Attribution 4.0 International License.