Scraping of heatpumpmonitor.org

Scraping of https://heatpumpmonitor.org/ website for pulling out real heat pump data.

Directory structure

heatpumpmonitoring_scraping
│   README.md
|   LICENSE
|   src
|   │   api.py  (API for scraping data)
|   │   scraping.py  (Scraping functions for obtaining the API key of a specific heat pump)
|   │   utils.py  (Utility functions for data handling)
|   |   fetch.py (Fetching functions for obtaining the data)
|   |   main.py (Main script for running the scraping)
│   requirements.txt
│   .gitignore

The procedure for scraping the data is as follows:

Extract the IDs of all the heat pumps from the website.
For each heat pump, extract the API key and the url of the MyHeatPump App through scraping of the web page of the specific heat pump.
Use the API key and the url to obtain the timeseries data from the APIs. Since there is a limit of the bytes that can be queried, the overall process is divided for each month and each 5 variables.

To run everything, you can use the main.py script. The script will scrape the data for all the heat pumps that have an associated API Key and save it in a csv file for the timeseries, and in a json file for the metadata associated to each variable.

Project initialization and setup

Clone the project from the repository:

git clone https://github.com/Giudice7/heatpumpmonitoring_scraping.git

Open the terminal and move to the project folder.
```
cd heatpumpmonitoring_scraping
```
Check the python version installed:
```
python --version
```
In this project, Python 3.11 was used, so it is recommended to use the same version to avoid compilation problems. If you have a different version, download the 3.11 from the following link.
Create a virtual environment (venv) in the project folder:
```
python -m venv venv
```
Activate the virtual environment:
```
venv/Scripts/activate
```
Install dependencies:
```
pip install -r requirements.txt
```

Contributor

Rocco Giudice

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scraping of heatpumpmonitor.org

Directory structure

Project initialization and setup

Contributor

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
src		src
static/img		static/img
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

Giudice7/heatpumpmonitoring_scraping

Folders and files

Latest commit

History

Repository files navigation

Scraping of heatpumpmonitor.org

Directory structure

Project initialization and setup

Contributor

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages