Time series: Processing Notebook
This Notebook is part of the Time series Data Package of Open Power System Data.

1. About Open Power System Data

This notebook is part of the project Open Power System Data. Open Power System Data develops a platform for free and open data for electricity system modeling. We collect, check, process, document, and provide data that are publicly available but currently inconvenient to use. More info on Open Power System Data:

2. About Jupyter Notebooks and GitHub

This file is a Jupyter Notebook. A Jupyter Notebook is a file that combines executable programming code with visualizations and comments in markdown format, allowing for an intuitive documentation of the code. We use Jupyter Notebooks for combined coding and documentation. We use Python 3 as programming language. All Notebooks are stored on GitHub, a platform for software development, and are publicly available. More information on our IT-concept can be found here. See also our step-by-step manual how to use the dataplatform.

3. About this datapackage

We provide data in different chunks, or data packages. The one you are looking at right now, Time series, contains various kinds of time series data in 15min, 30min or 60min resolution, namely:

  • electricity consumption (load)
  • wind and solar power: capacity, generation forecast, actual generation
  • day-ahead spot prices

The main focus of this datapackage is German data, but we include data from other countries wherever possible. The timeseries become available at different points in time depending on the sources. The full dataset is only available from 2015 onwards. The data has been downloaded from the sources, resampled and merged in a large CSV file with hourly resolution. Additionally, the data available at a higher resolution (some renewables in-feed, 15 minutes) is provided in a separate file.

4. Data sources

The main data sources are the various European Transmission System Operators (TSOs) and the ENTSO-E Data Portal. Where no data is available from hte TSOs directly, data are taken from the ENTSO-E Transparency Plstform. A complete list of data sources is provided on the datapackage information website. They are also contained in the JSON file that contains all metadata.

5. Naming conventions

In [1]:
import pandas as pd; pd.read_csv('input/notation.csv', index_col=list(range(4)))
region variable attribute Explanation
ISO-2 digit country code and name of control balancing area if applicable, eg. DE_amprion load entsoe_power_statistics Hourly Load from ENTSO-E Data Portal
entsoe_transparency Hourly Load from ENTSO-E Transparency Platform
price day_ahead Day-ahead spot price
solar / wind-onshore / wind-offshore generation_actual Electricity produced py solar power plants
generation_forecast Day-ahead generation forecast
capacity Installed capacity (actual availability not accounted for)
profile Share of installed capacity producing

6. License

This notebook as well as all other documents in this repository is published under the MIT License.