Skip to content

A web scraper program to extract data on the top 1000 schools based on UTBK scores in 2022 using Scrapy

License

Notifications You must be signed in to change notification settings

crypter70/LTMPT-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LTMPT-Scraper

Overview

The program aimed to extract data on the top 1000 schools based on UTBK scores in 2022 using Scrapy. The targeted website was a static one, and the data was not loaded using JavaScript. Therefore, Scrapy was an appropriate choice due to its efficiency and speed in handling static data on websites. The program extracted relevant data such as the schools' names, locations, UTBK scores, and other relevant information. The extracted data was useful for analyzing and evaluating the schools' academic performance and ranking. The data was saved in CSV format for further processing and analysis.

URL: https://top-1000-sekolah.ltmpt.ac.id

Website: image

Official Docs

Scrapy Documentation https://scrapy.org/

Installation

pip install scrapy
pip3 install scrapy

Run Program

scrapy runspider scraper.py

Export Data

csv

scrapy runspider scraper.py -o data.csv

json

scrapy runspider scraper.py -o data.json

About

A web scraper program to extract data on the top 1000 schools based on UTBK scores in 2022 using Scrapy

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages