Implementation of Web Scraping for Journal Data Collection on the SINTA Website
DOI:
10.33395/sinkron.v7i4.11576Abstract
SINTA is a website portal pioneered by the Director General of R&D Improvement, Ministry of Research Technology, and Higher Education, Republic of Indonesia to make it easier for researchers to search for journals for publication. However, in its implementation, many researchers have encountered problems, one of which is in the form of searching for publication duration and ranking, as well as searches that are carried out manually, making it difficult for researchers to find a place to publish. In this study, the author took journal data based on the SINTA website with Web scraping techniques using the Python programming language and then stored the data in the SINTA database, then scheduling a Cron job so that the data in the database was always updated. It is hoped that the results of this study can help researchers in searching for journals for publication.
Downloads
References
Al hadi, I. F., Chusna, C., Ilham, S., & Fauzan, A. C. (2019). Implementasi Penjadwalan Round Robin pada Task Scheduler untuk Pembaruan Aplikasi Otomatis. ILKOMNIKA: Journal of Computer Science and Applied Informatics, 1(1), 11–14. https://doi.org/10.28926/ilkomnika.v1i1.7
Arumi, E. R., & Sukmasetya, P. (2020). Exploiting Web Scraping for Education News Analysis Using Depth-First Search Algorithm. JOIN (Jurnal Online Informatika ), 5(1), 19–26. https://doi.org/10.15575/join.v5i1.548
Christanto, F. W., & Rudiyanto. (2020). Cron Job Technique pada Integrasi WLAN Controller Device dan Google Maps API Berbasis Website dalam Jaringan Indonesia Wifi. Matrix : Jurnal Manajemen Teknologi Dan Informatika, 10(2), 50–57. https://doi.org/10.31940/matrix.v10i2.1477
Josi, A., & Andretti Abdillah, L. (n.d.). PENERAPAN TEKNIK WEB SCRAPING PADA MESIN PENCARI ARTIKEL ILMIAH.
Lukman, Ahmadi, S. S., Manalu, W., & Hidayat, D. S. (2019). Publikasi. Kementerian Riset, Teknologi, Dan Pendidikan Tinggi, 1–214. http://risbang.ristekdikti.go.id
Mitchell, R. (2018). Web Scraping with Python and BeautifulSoup. O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 9.
Mufidah, U. (2021). Perancangan Aplikasi Perbandingan Harga Produk (Historical Data) Menggunakan Teknik Web Scraping. Skripsi, 1(1), 1–14.
Priyanto, A., & Ma’arif, M. R. (2018). Implementasi Web Scrapping dan Text Mining untuk Akuisisi dan Kategorisasi Informasi dari Internet (Studi Kasus: Tutorial Hidroponik). Indonesian Journal of Information Systems, 1(1), 25–33. https://doi.org/10.24002/ijis.v1i1.1664
Purnomo, L. M., & Ayub, M. (2021). Analisis data hasil web scraping untuk menentukan kualitas jurnal ilmiah. Jurnal STRATEGI-Jurnal Maranatha, 3(1), 122–132. http://strategi.it.maranatha.edu/index.php/strategi/article/view/237
Rahmatulloh, A., & Gunawan, R. (2020). Web Scraping with HTML DOM Method for Data Collection of Scientific Articles from Google Scholar. Indonesian Journal of Information Systems, 2(2), 95–104. https://doi.org/10.24002/ijis.v2i2.3029
Sahria, Y. (2020). Implementasi Teknik Web Scraping pada Jurnal SINTA Untuk Analisis Topik Penelitian Kesehatan Indonesia. In URECOL (Unversity Research Colloqium). http://repository.urecol.org/index.php/proceeding/article/view/1079
Saputra, A. (2020). Pemanfaatan Science and Technology Index (SINTA) untuk Publikasi Karya Ilmiah dan Pencarian Jurnal Nasional Terakreditasi.
Sembiring, F., & Erfina, A. (2020). Bahasa Ular untuk Pemrograman Python (R. Aminah (ed.)). Insan Cendekia Mandiri.
Sembiring, F., Fergina, A., Saepudin, S., Erfina, A., & Gustian, D. (2020). Fundamental_Basis_Data. Media Sains Indonesia.
Sembiring, F., & Sari, D. P. (2019). INTEGRATED (Information Tecknology and Vocational Education) Volume Design Process Data Storage and Organize Dat... Design Process Data Storage and Organize Data Scraping. 1(1), 22–26.
Uzun, E., Yerlikaya, T., & Kırat, O. (2018). Comparison of Python Libraries used for Web Data Extraction. Journal of the Technical University - Sofia Plovdiv Branch, Bulgaria, 24(May), 87–92. https://erdincuzun.com/wp-content/uploads/download/plovdiv_journal_2018_01.pdf
Wirawan, A. (2020). Sistem Scheduling Pelaporan Data Akademik di UIN Sunan Kalijaga ke Pangkalan Data Pendidikan Tinggi (PDDikti) dengan Menggunakan Fitur Cron Job di Linux. JISKA (Jurnal Informatika Sunan Kalijaga), 5(3), 177–184. https://doi.org/10.14421/jiska.2020.53-05
Yanti, G., Z, Z., & Megasari, S. W. (2020). Pelatihan Penulisan Artikel untuk Publikasi E-Jurnal bagi Researcher Club. Dinamisia : Jurnal Pengabdian Kepada Masyarakat, 4(3), 461–469. https://doi.org/10.31849/dinamisia.v4i3.4107
Downloads
How to Cite
Issue
Section
License
Copyright (c) 2022 Nelawati Adila
![Creative Commons License](http://i.creativecommons.org/l/by-nc/4.0/88x31.png)
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.