Skip to content

A web scraper program to extract product data on the Tokopedia marketplace website based on keywords using Selenium

License

Notifications You must be signed in to change notification settings

crypter70/Tokopedia-Scraper

Repository files navigation

Tokopedia-Scraper

Overview

The program aimed to extract product data from the Tokopedia marketplace website based on specified keywords using web scraping techniques. Selenium with JavaScript-enabled selectors was utilized to extract the data due to the dynamic elements on the website. The extracted data included product name, price, location, rating, number of items sold, and details link, which were essential for data analysis and market research. The data was saved in both CSV and JSON formats for further processing and analysis.

URL: https://www.tokopedia.com/

Website: Screenshot 2023-02-09 at 21 49 34

Prerequisite

  • Python 3
  • WebDriver

Official Docs

Selenium Documentation: https://www.selenium.dev/documentation/

Installation

Installing Selenium

pip install selenium
pip3 install selenium

Installing tqdm

pip install tqdm
pip3 install tqdm

Disclaimer

This code is intended for educational purposes. Please respect privacy, copyright and code and data terms of use. Read more about Tokopedia here.

Rules

  1. Do not retrieve unnecessary data
  2. Do not use the collected data for illegal purposes
  3. Maintain the confidentiality of the data collected
  4. Documenting data scraping times

Website Version

Last scraping date: 9 February 2023

About

A web scraper program to extract product data on the Tokopedia marketplace website based on keywords using Selenium

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages