- Python selenium script developed to scrape TNEB website for electricity consumption charges.
- Scheduled to run every Sunday at 00:00 (refer to .github/workflows/schedule-scraper.yml).
- Utilizes
Tesseract
for bypassing the captcha validation during the scraping process. - Implements automated retries in case of scraping failures due to captcha or other issues.
- Sends scraping results (success/failure) via email after each job run.
- Clone the github repository
- Ensure you have
docker
installed on your machine - Create a .env file. Clone the sample.env file and customize it according to your needs.
- Create an eb-mapping.json file containing mappings for the Consumer No. You can use the sample-mapping.json file as a reference.
- Run command
docker compose up
(ordocker-compose up
)