Explanatory Data Analysis to Evaluate Keyword Searches for Educational Videos on YouTube with a Machine Learning Approach


  • Mambang Mambang Universitas Sari Mulia, Banjarmasin
  • Ahmad Hidayat Universitas Sari Mulia, Banjarmasin
  • Johan Wahyudi STMIK Indonesia Banjarmasin
  • Finki Dona Marleny Universitas Muhammadiyah Banjarmasin




Explanatory Data Analysis, Evaluating, Educational Videos, YouTube, Machine Learning


One of the most important parts of data science is the process of explanatory data analysis. This study aims to analyze learning videos on YouTube using search keywords such as learning biology, chemistry, physics, computers, mathematics, management, accounting, citizenship, history, and culture. The method used is the explanatory data analysis technique with a Machine Learning approach. The dataset used in this study uses learning video search keywords found on the YouTube digital platform. After doing a thorough analysis of all existing variables, we found that in the context of searching for learning video keywords on YouTube, the viewing variable has a heatmap correlation of 0.97 on the likes variable, 0.97 on the subscribers variable, -0.15 on the duration variable and 0.95 on the comment variable. The duration variable negatively correlates with all variables based on the analysis using a correlation heatmap using the seaborn library. Our analysis found that the number of learning videos with the search keyword Mathematics had the highest number of views among other variables. Further research can use existing variables or also add variables and add search keywords on YouTube. The data analysis approach can also be done using SPSS, R and also a Machine Learning approach with different libraries.

GS Cited Analysis


Download data is not yet available.


Ahmed, Alaa H., Mokhaled N. A. Al-hamadani, and Ihab A. Satam. 2022. “Prediction of COVID-19 Disease Severity Using Machine Learning Techniques.” Bulletin of Electrical Engineering and Informatics 11(2):1069–74. doi: 10.11591/eei.v11i2.3272.

Al-zaman, Sayeed. 2022. “Social Mediatization of Religion : Islamic Videos on YouTube.” Heliyon 8(February):e09083. doi: 10.1016/j.heliyon.2022.e09083.

Davazdahemami, Behrooz, Hamed M. Zolbanin, and Dursun Delen. 2022. “An Explanatory Analytics Framework for Early Detection of Chronic Risk Factors in Pandemics.” Healthcare Analytics 2(January):100020. doi: 10.1016/j.health.2022.100020.

Elareshi, Mokhtar, Mohammed Habes, Enaam Youssef, Said A. Salloum, and Raghad Alfaisal. 2022. “SEM-ANN-Based Approach to Understanding Students ’ Academic-Performance Adoption of YouTube for Learning during Covid.” Heliyon 8(May 2021):e09236. doi: 10.1016/j.heliyon.2022.e09236.

Foster, Brian K., William Mack Malarkey, Timothy C. Maurer, Daniela F. Barreto Rocha, Idorenyin F. Udoeyo, and Louis C. Grandizio. 2022. “Biceps Tendon Rupture Videos on YouTube : An Analysis of Video Content and Quality.” Journal of Hand Surgery Global Online 4(1):3–7. doi: 10.1016/j.jhsg.2021.10.009.

Kim, Taemin, and Soobum Lee. 2021. “Predictors of Viewing YouTube Videos on Incheon Chinatown Tourism in South Korea : Engagement and Network Structure Factors.” Sustainability 13:1–11. doi: 10.3390/su132212534.

Lawson, Alyssa P., and Richard E. Mayer. 2022. “Does the Emotional Stance of Human and Virtual Instructors in Instructional Videos Affect Learning Processes and Outcomes ?” Contemporary Educational Psychology 70(May):102080. doi: 10.1016/j.cedpsych.2022.102080.

Minn, Sein. 2022. “AI-Assisted Knowledge Assessment Techniques for Adaptive Learning Environments.” Computers and Education: Artificial Intelligence 3(July 2021):100050.

Mohammadhassan, Negar, Antonija Mitrovic, and Kourosh Neshatian. 2022. “Investigating the Effect of Nudges for Improving Comment Quality in Active Video Watching.” Computers & Education 176(September 2021):104340. doi: 10.1016/j.compedu.2021.104340.

Nisa, Meher U. N., Danish Mahmood, Ghufran Ahmed, Suleman Khan, and Mazin Abed Mohammed. 2021. “Optimizing Prediction of YouTube Video Popularity Using XGBoost.” Electronics Article 10:1–16. doi: 10.3390/electronics10232962.

Ramadhani, Atik, Zenobia Zettira, Yuanita Lely Rachmawati, and Ninuk Hariyani. 2021. “Quality and Reliability of Halitosis Videos on YouTube as a Source of Information.” Dentistry Journal Article 9(10):1–9. doi: 10.3390/dj9100120.

Shi, Hui, Dong Yang, Kaichen Tang, Chunmei Hu, Lijuan Li, and Linfang Zhang. 2022. “Explainable Machine Learning Model for Predicting the Occurrence of Postoperative Malnutrition in Children with Congenital Heart Disease.” Clinical Nutrition 41(1):202–10. doi: 10.1016/j.clnu.2021.11.006.

Yurdaisik, Isil. 2020. “Analysis of the Most Viewed First 50 Videos on YouTube about Breast Cancer.” BioMed Research International 2020:1–7. doi: 10.1155/2020/2750148.

Zhao, Pengxiang, He Haitao, Aoyong Li, and Ali Mansourian. 2021. “Impact of Data Processing on Deriving Micro-Mobility Patterns from Vehicle Availability Data.” Transportation Research Part D 97(June):102913. doi: 10.1016/j.trd.2021.102913.

Zhao, Yanmin, and Yang You. 2021. “Design and Data Analysis of Wearable Sports Posture Measurement System Based on Internet of Things.” Alexandria Engineering Journal 60(1):691–701. doi: 10.1016/j.aej.2020.10.001.


Crossmark Updates

How to Cite

Mambang, M., Hidayat, A. ., Wahyudi, J. ., & Marleny, F. D. . (2022). Explanatory Data Analysis to Evaluate Keyword Searches for Educational Videos on YouTube with a Machine Learning Approach. Sinkron : Jurnal Dan Penelitian Teknik Informatika, 7(3), 915-922. https://doi.org/10.33395/sinkron.v7i3.11502