Skip to content

csse-uoft/website-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Install Dependencies

conda env create -f PyWebScraper.yml

Run

conda activate PyWebScraper
python main.py -u http://domain.com
Paramters
  • -u : Starting URL to parse (e.g. http://main.com).
  • -m : XPath to look for main content (e.g. 'div.main', 'div[id="main"]').
  • -n : XPath to look for site navigation links (e.g. 'div.nav a').
  • -js : Whether to run JavaScript on page or not (0=False, 1=True (default)).

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages