This repository contains supply and idle data in zones of the Baltic Sea as a result of historical vessel movements from 2020 based on automatic identification system (AIS) data provided by the Danish Maritime Authority (DMA).
The processed data is located within the data folder of this repository. The folder contains three different files.
- The
supply.csv
file shows supply in tons on a weekly basis per zone. Thezone
column corresponds to the respective zone, thecalender_week
column to the calendar week and thesupply_tons
column to `the amount of supply in metric tons (generalized over all different types of cargo). - The
idle.csv
file shows idle times in hours on a weekly basis per zone. Thezone
column corresponds to the respective zone, thecalendar_week
column to the calendar week and theidle_hours
column to the average idle time in hours. - The
zones.csv
file contains a list of all zones. - The
baltic_sea_coastal_line_polygons.pkl
file is a pickle file of a dataframe containing definitions of polygons for the zones found in filezones.csv
.
The supply and idle data is derived from AIS signals provided by the Danish Maritime Authority. Since the AIS signals, besides some specifications about the vessel itself, only contain information about the current geo-position of same, further preprocessing steps are necessary to calculate the supply and demand in the different zones of the Baltic Sea. The following parameters, given in AIS signals, are used for the calculations: timestamp
, latitude
, longitude
, imo
, mmsi
, draught
, destination
. Below screenshot shows the zones, in which we geographically have split up the Baltic Sea. Since we are interested in the supply flow between landmass, we only consider polygons on the coastal line. To get the polygon definition, please load the polygones.pkl
in Python using pickle.
To calculate supply and idle in zones based on AIS, following steps have been performed:
- Get all AIS signals related to one vessel. The
imo
ormmsi
is a unique identifier for a vessel and can be used to consider every vessel one by one. - The time wise ordered AIS data (use
timestamp
), more precisely, the geo-position based onlatitude
andlongitude
illustrate the route a vessel is taking. - To derive supply zones with its idle times, the zone in which the vessel is loading (supply zone) has to be determined. To do so, the vessels
draft
anddestination
give an indication. The point in time, i.e., the AIS signal, in which thedestination
changed compared to the previous signal, denotes the start of a new voyage. Further, if the vesselsdraft
has increased, the vessel was loading cargo, thus, the vessel is in a supply zone and travels towards a new zone to discharge. - Using a vessel database, e.g. Vesselfinder (see below), we determine the vessels maximum capacity in metric tons, its so called deadweight (dwt).
- Since the dwt denotes the capacity of a vessel in metric tons, we take the dwt of the vessel as the amount of cargo, the vessel was loading. Note, it may not reflect one hundred percent of reality.
- Performing such steps for each vessel, yields voyages between supply zones including the amount of cargo transported.
- Aggregating the data on a weekly and zone basis, gives data as provided in the
supply.csv
file. - To determine the idle time in a supply zone, one simply calculates the time from entering a zone until loading. Averaging idle times over all vessels for the same zone at the same time step gives the idle time reflected in the
idle.csv
file.
- Raw AIS Data: https://dma.dk/safety-at-sea/navigational-information/ais-data
- Vessel Database: https://www.vesselfinder.com/de