Skip to content

A web scraper program to extract university data and publication scores on the SINTA website using Scrapy

License

Notifications You must be signed in to change notification settings

crypter70/SINTA-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SINTA-Scraper

Overview

The program aimed to extract university data and publication scores from the SINTA website using Scrapy. The targeted website is static, and the data is not loaded using JavaScript, which makes Scrapy an appropriate choice due to its efficiency and speed in handling static data on websites. The extracted data included relevant data such as the university's name, location, and publication scores. The data was saved in CSV format for further processing and analysis.

URL: https://sinta.kemdikbud.go.id/affiliations

Website: image

Official Docs

Scrapy Documentation https://scrapy.org/

Installation

pip install scrapy
pip3 install scrapy

Run Program

scrapy runspider scraper.py

Export Data

csv

scrapy runspider scraper.py -o data.csv

json

scrapy runspider scraper.py -o data.json

About

A web scraper program to extract university data and publication scores on the SINTA website using Scrapy

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages