A docker based crawler Sections: + How To Run + Explanation + Troubleshoot
- Install Docker Desktop using the steps in: Docker
- Clone this repository wherever you like: https://github.com/rxwp5657/DockerCrawler.git
- Change to the project directory:
cd DockerCrawler
- Type the commands:
- The previous output means that three crawlers where made and that are running already...
- If everything is OK, you should see the following on the files
- The
docker-compose.yml
is in charge of making multiple crawlers - Each crawler process a subset of the words, this is:
- Crawler process from A to H
- Crawler2 process from H to P
- Crawler3 process from P to Z
- All of the crawlers run the same code described on the file
./ExtraCrawler/extra.py
- Each of the crawlers results are saved on their respective folder and, especifically, the file
english_words.csv
If there is a problem with the docker commands, check: