simple_image_scraper

web scraping for downloading a number of images from web

This script is made for scaping a number of images from certain web site.

I made this repo so that i can refer this when i code something for web scraping next time.

If you want to use this script, you most likely have to alter some parts of this script based on a web site you use at. Using deveopper mode of browsers would be helpful.

Hope you can get something from this piece of codes too !

what does this code do?

Read urls from urls.txt
Open each url with brave browser using selenium and get html
Get <script> tag which contains JSON using BeautifulSoup4
Extract some information from JSON
Create a new directory and ownload images from server by running curl command on shell, and save them on the directory

Modules

python3 -m pip install -r requirements.txt

By doing this, you can import beautifulsoup4 and selenium.

Notes

You need to list target urls in the urls.txt
If something happens and code stops, urls which is not downloaded yet will be output to the failed_urls.txt. Next time you run the code, urls in the failed_urls.txt will be added to the urls.txt automatically.
Selenium manipulates Brave browser on this code. You can use Chrome as well.
You have to download chromedriver.exe from web. You can put it in your working directory if you want.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
failed_urls.txt		failed_urls.txt
requirements.txt		requirements.txt
scraper.py		scraper.py
tags.txt		tags.txt
urls.txt		urls.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

simple_image_scraper

what does this code do?

Modules

Notes

About

Releases

Packages

Languages

Zetsu4i/simple_image_scraper

Folders and files

Latest commit

History

Repository files navigation

simple_image_scraper

what does this code do?

Modules

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages