This repository has been archived by the owner on Sep 11, 2022. It is now read-only.
Releases: danthespal/webScraper
Releases · danthespal/webScraper
webScraper v0.4 - alpha
- added requirements.txt file so that installing same libraries will be easier and updated README file
- corrected pep8 warnings
- added dynamic user-agents to settings file
- added additional info about a product
- added loop until last page to extract all elements
- addapt css to xpath, see more info in comments
- store items informations into sqlite database
- fix some problems in saving items
- increase column product_name string
- check for duplicate product_name
webScraper v0.3 - alpha
- download from multiple pages
webScraper v0.2 - alpha
- base for downloading items [crawler.py]
- little customization for product_name [items.py]
- add user agent (you need to install 'scrapy-user-agents 0.1.1') [settings.py]
- enabled download_delay and auto_throttle for better scraping [settings.py]
webScraper v0.1 - alpha
just the basics scrapy framework generated content