Social Media (Twitter) Based Movie Recommendation System On Disney+ With Hybrid Filtering Using Neighboor's K-Nearest Method
Viewed = 0 time(s)
Abstract
The research aims on the development of a film recommendation system that combines the Hybrid Filtering and K-Nearest Neighbors method. (KNN). Hybrid Filtering combines a variety of recommendation techniques, including collaborative filtering and content-based filtering, while KNN is a simple method for classifying data based on similarities with close neighbors. The data set used is Kaggle’s Rotten Tomatoes, which includes information such as critics’ names, genres, movie titles, and review content. The aim of the study was to build an accurate system of recommendations based on user ratings on Disney+ Hotstar and measure its performance using MAE (Mean Absolute Error) and Confusion Matrix assessments.The results showed that the combination of Hybrid Filtering and KNN methods resulted in better accuracy values in giving film recommendations compared to using only the Collaborative Filtering method. Graphics and performance analysis show that the developed models are able to provide film recommendations with increasing accuracy over time. In conclusion, the combination of Hybrid Filtering and K-Nearest Neighbors methods is effective in improving the accuracy of the movie recommendation system, helping users choose films that match their preferences on the Disney+ Hotstar platform. This research contributes to the development of better and more accurate recommendation systems in the film industry.
References
Ali, E. (2020). Geographic Information System ( GIS ): Definition , Development , Applications & Components. Academia, 79.
Bairagi, Vinayak; Munot, M. V, JE, P. C., Sultan, M. T., Selvan, C. P., Irulappasamy, S., Mustapha, F., Basri, A. A., Safri, S. N. A. N. A., Taamneh, S., Tsiamyrtzis, P., Dcosta, M., Buddharaju, P., Khatri, A., Manser, M., Ferris, T., Wunderlich, R., Pavlidis, I., Rastgoo, M. N., Wang, X. X., … Nichele, S. (2019). Research Methodology A Practical and Scientific Approach. In Transportation Research Part F: Traffic Psychology and Behaviour (Vol. 4, Issue March 2019).
Bisong, E. (2019). Google Colaboratory. In Building Machine Learning and Deep Learning Models on Google Cloud Platform. https://doi.org/10.1007/978-1-4842-4470-8_7
Chen, K., Duan, Z., & Yang, S. (2022). Twitter as research data. Politics and the Life Sciences, 41(1). https://doi.org/10.1017/pls.2021.19
De Groote, P. (2011). Globalisation of commercial theme parks case: The Walt Disney Company. Applied Studies in Agribusiness and Commerce, 5(3–4). https://doi.org/10.19041/apstract/2011/3-4/2
Findawati, Y., Astutik, I. R. I., Fitroni, A. S., Indrawati, I., & Yuniasih, N. (2019). Comparative analysis of Naïve Bayes, K Nearest Neighbor and C.45 method in weather forecast. Journal of Physics: Conference Series, 1402(6). https://doi.org/10.1088/1742-6596/1402/6/066046
Gupta, A., Kumar, L., Jain, R., & Nagrath, P. (2020). Proceedings of First International Conference on Computing, Communications, and Cyber-Security (IC4S 2019). In Lecture Notes in Networks and Systems (Vol. 121, Issue Ic4s).
Kadhim, A. I. (2018). An Evaluation of Preprocessing Techniques for Text Classification. International Journal of Computer Science and Information Security, 16(6), 22–32. https://sites.google.com/site/ijcsis/
Kastouni, M. Z., & Ait Lahcen, A. (2022). Big data analytics in telecommunications: Governance, architecture and use cases. In Journal of King Saud University - Computer and Information Sciences (Vol. 34, Issue 6). https://doi.org/10.1016/j.jksuci.2020.11.024
Lilleberg, J., Zhu, Y., & Zhang, Y. (2015). Support vector machines and Word2vec for text classification with semantic features. Proceedings of 2015 IEEE 14th International Conference on Cognitive Informatics and Cognitive Computing, ICCI*CC 2015, 136–140. https://doi.org/10.1109/ICCI-CC.2015.7259377
Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2006). Distributed Representations of Words and Phrases and their Compositionality. Neural Information Processing Systems, 1, 1–9.
Mohan Kumar, A. V., Suhas, M., & Fedrich, N. (2022). Sentiment Analysis on Twitter Data. In Cognitive Science and Technology. https://doi.org/10.1007/978-981-19-2350-0_43
Paper, D. (2021). Build Your First Neural Network with Google Colab. In TensorFlow 2.x in the Colaboratory Cloud. https://doi.org/10.1007/978-1-4842-6649-6_2
Pisarenko, V. F., & Pisarenko, D. V. (2022). A Modified k-Nearest-Neighbors Method and Its Application to Estimation of Seismic Intensity. Pure and Applied Geophysics, 179(11). https://doi.org/10.1007/s00024-021-02717-y
Prasetya, C. S. D. (2017). Sistem Rekomendasi Pada E-Commerce Menggunakan K-Nearest Neighbor. Jurnal Teknologi Informasi Dan Ilmu Komputer, 4(3). https://doi.org/10.25126/jtiik.201743392
Purbolaksono, M. D., & Suryani, A. A. (n.d.). Skip-Gram Negative Sample for Word Embedding in Indonesian-Translation Text Classification using Backpropagation Classifier. x, 1–9.
Shakya, S., & Dave, M. (2022). Analysis, Detection, and Classification of Android Malware using System Calls. https://arxiv.org/abs/2208.06130v1%0Ahttps://arxiv.org/ftp/arxiv/papers/2208/2208.06130.pdf
Wang, X., Dai, Z., Li, H., & Yang, J. (2021). Research on Hybrid Collaborative Filtering Recommendation Algorithm Based on the Time Effect and Sentiment Analysis. Complexity, 2021. https://doi.org/10.1155/2021/6635202
Wang, X., Jiang, W., & Luo, Z. (2016). Combination of convolutional and recurrent neural network for sentiment analysis of short texts. COLING 2016 - 26th International Conference on Computational Linguistics, Proceedings of COLING 2016: Technical Papers, 2428–2437.
Xu, Y. (2022). Introduction and Development Strategy of Walt Disney Company. Highlights in Business, Economics and Management, 1. https://doi.org/10.54097/hbem.v1i.2529
Copyright (c) 2023 Azrina Fazira Ansshory, Erwin Budi Setiawan
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.