Mapping of Underdeveloped Areas Based On Research Frequency Utilizing Distributed Web Scraping and Web GIS
Keywords:
Web Scraping, Web GIS, Distributed SystemAbstract
Indonesia is one of the countries that have many scattered areas. One of the issues of the many scattered areas is the development of underdeveloped areas. Scientific research can be used as a reference for increasing the development of an area based on the frequency of an area being the object of research by representing the data represented in maps and statistics. A histogram map will help in the process of analyzing areas and topics that are not covered by the research. The data collection technique used is distributed parallel web scraping to speed up the collection process from 46,280 regions in Indonesia. The system development method used is the SDLC (Software Development Life Cycle) waterfall starting from requirements analysis, system design, development, testing, and maintenance. The results are that the distributed scraping process generates data faster than running a single scraper. Scraping result data will be processed into maps and statistics that can assist researchers in interpreting and figuring out underdeveloped areas in Indonesia.
References
Arifin, Mohammad Nazir, and Daniel Siahaan. ‘Structural and Semantic Similarity Measurement of UML Use Case Diagram’. Lontar Komputer Jurnal Ilmiah Teknologi Informasi, vol. 11, no. 2, Universitas Udayana, July 2020, p. 88, https://doi.org10.24843/lkjiti.2020.v11.i02.p03.
Boeing, Geoff, and Paul Waddell. ‘New Insights into Rental Housing Markets across the United States: Web Scraping and Analyzing Craigslist Rental Listings’. SSRN Electronic Journal, Elsevier BV, 2016, https://doi.org10.2139/ssrn.2781297.
Camp, Michael. Geologic Map of Arizona. The University of Arizona., Apr. 2022, https://repository.arizona.edu/handle/10150/664145.
El Malki, Amine, et al. ‘Impact of API Rate Limit on Reliability of Microservices-Based Architectures’. 2022 IEEE International Conference on Service-Oriented System Engineering (SOSE), IEEE, 2022, https://doi.org10.1109/sose55356.2022.00009
Simatupang, Dimas Frananta, and Ramadhani Ramadhani. ‘Penentuan Kebutuhan Injeksi Ammonia Untuk Meningkatkan PH Pada Air Umpan Boiler: Studi Kasus Di PT. XYZ Sumatera Utara’. Jurnal Pendidikan Dan Teknologi Indonesia, vol. 1, no. 5, Infinite Corporation, May 2021, pp. 187–191, https://doi.org10.52436/1.jpti.42.
On Line Blood Bank Management System: A Web Application’. Journal of Information Engineering and Applications, International Institute for Science, Technology and Education, Oct. 2019, https://doi.org10.7176/jiea/9-6-01.
Irawan, Yudie, et al. ‘System Testing Using Black Box Testing Equivalence Partitioning (Case Study at Garbage Bank Management Information System on Karya Sentosa)’. Proceedings of the The 1st International Conference on Computer Science and Engineering Technology Universitas Muria Kudus, EAI, 2018, https://doi.org10.4108/eai.24-10-2018.2280526.
Islam, Md Rofiqul, et al. ‘Code Smell Prioritization with Business Process Mining and Static Code Analysis: A Case Study’. Electronics, vol. 11, no. 12, MDPI AG, June 2022, p. 1880, https://doi.org10.3390/electronics11121880
Jumadi, Jumadi, et al. ‘A Year of Spatio-Temporal Clusters of COVID-19 in Indonesia’. Quaestiones Geographicae, vol. 41, no. 2, Adam Mickiewicz University Poznan, June 2022, pp. 139–151, https://doi.org10.2478/quageo-2022-0013.
Lee, Jae Moon, et al. ‘Design and Implementation of Database for Shared Facility Reservation System in School’. Journal of Positive School Psychology , vol. 6, no. 8, Aug. 2022, pp. 7033–7041, https://www.journalppw.com/index.php/jpsp/article/view/11028.
Lin, Zhen, et al. ‘FlashCube’. Proceedings of the 11th Workshop on Programming Languages and Operating Systems, ACM, 2021, https://doi.org10.1145/3477113.3487273.
Mabrouk, Alhassan, et al. ‘SEOpinion: Summarization and Exploration of Opinion from E-Commerce Websites’. Sensors (Basel, Switzerland), vol. 21, no. 2, MDPI AG, Jan. 2021, p. 636, https://doi.org10.3390/s21020636.
Karac, Itir, and Burak Turhan. ‘What Do We (Really) Know about Test-Driven Development?’ IEEE Software, vol. 35, no. 4, Institute of Electrical and Electronics Engineers (IEEE), July 2018, pp. 81–85, https://doi.org10.1109/ms.2018.2801554.
Nastuła, Anna. ‘Dilemmas Related to the Functioning and Growth of Darknet and the Onion Router Network’. Journal of Scientific Papers ‘Social Development and Security’, vol. 10, no. 2, Ukrainian Scientific Community, Apr. 2020, pp. 3–10, https://doi.org10.33445/sds.2020.10.2.1.
Poniszewska-Marańda, Aneta, and Ewa Czechowska. ‘Kubernetes Cluster for Automating Software Production Environment’. Sensors (Basel, Switzerland), vol. 21, no. 5, MDPI AG, Mar. 2021, p. 1910, https://doi.org10.3390/s21051910.
Renaldi, Ridwan, and Dimas Aryo Anggoro. ‘Sistem Informasi Geografis Pemetaan Sekolah Menengah Atas/Sederajat di Kota Surakarta menggunakan Leaflet Javascript Library berbasis Website’. Emitor: jurnal teknik elektro, vol. 20, no. 2, Universitas Muhammadiyah Surakarta, July 2020, pp. 109–116, https://doi.org10.23917/emitor.v20i02.10945.
Sawant, Khushboo, et al. ‘Implementation of Selenium Automation & Report Generation Using Selenium Web Driver & ATF’. 2021 International Conference on Advances in Electrical, Computing, Communication and Sustainable Technologies (ICAECT), IEEE, 2021, pp. 1–6, https://doi.org10.1109/icaect49130.2021.9392455
Sierra-Fernández, Jose-María, et al. ‘Online System for Power Quality Operational Data Management in Frequency Monitoring Using Python and Grafana’. Energies, vol. 14, no. 24, MDPI AG, Dec. 2021, p. 8304, https://doi.org10.3390/en14248304.
Nastuła, “Dilemmas related to the functioning and growth of Darknet and the Onion Router network,” paperssds.eu, vol. 10, no. 2, p. 2020, 2020, doi: 10.33445/sds.2020.10.2.1.
K. Sawant, R. Tiwari, and S. Vyas, “Implementation of Selenium Automation & Report Generation Using Selenium Web Driver & ATF,” ieeexplore.ieee.org, 2021, Accessed: Oct. 21, 2022. [Online]. Available: https://ieeexplore.ieee.org/abstract/document/9392455/
Sierra-Fernández, Jose-María, et al. ‘Online System for Power Quality Operational Data Management in Frequency Monitoring Using Python and Grafana’. Energies, vol. 14, no. 24, MDPI AG, Dec. 2021, p. 8304, https://doi.org10.3390/en14248304.
Sulastio, Bezaliel Septian, et al. ‘Sistem Informasi Geografis Untuk Menentukan Lokasi Rawan Macet Di Jam Kerja Pada Kota Bandarlampung Pada Berbasis Android’. Jurnal Teknologi Dan Sistem Informasi, vol. 2, no. 1, Mar. 2021, pp. 104–111, https://doi.org10.33365/jtsi.v2i1.755.
Winarti, Dwi. ‘PERANCANGAN SISTEM INFORMASI GEOGRAFIS (GIS) BERBASIS WEB PENYEBARAN FASILITAS PENDIDIKAN, PERUMAHAN DAN RUMAH SAKIT DI KOTA DUMAI’. Simtika, vol. 2, no. 1, Feb. 2019, pp. 18–21, http://ejournal.undhari.ac.id/index.php/simtika/article/view/12.
Njunge, Christopher, et al. Journal of Information Systems Applied Research. https://jisar.org/2022-15/n3/JISARv15n3.pdf#page=24. Accessed 10 Dec. 2022. Proceedings of the 2018 International Conference on Management of Data. ACM, 2018, https://doi.org10.1145/3183713.
Yufikar, Uray. ‘Studi Skalabilitas Pemrosesan Paralel Pada Sistem Terdistribusi’. JURNAL TIARSIE, vol. 16, no. 1, Fakultas Teknik Universitas Langlangbuana, May 2019, https://doi.org10.32816/tiarsie.v16i1.39.
Downloads
Published
Versions
- 2022-12-20 (2)
- 2022-12-20 (1)
How to Cite
Issue
Section
License
Copyright (c) 2022 Fransyudha Abandhika; Alfian Nur Fathoni, Diah Priyawati

This work is licensed under a Creative Commons Attribution 4.0 International License.