Improve performance v0.11.0 #1015

datejada · 2025-02-05T11:09:56Z

Using JuMP.add_to_expression! and subset when creating the constraints over clustered year (inter-temporal constraints) to improve efficiency.

Related issues

Closes #1014

Checklist

I am following the contributing guidelines
Tests are passing
Lint workflow is passing
Docs were updated and workflow is passing

github-actions · 2025-02-05T11:10:20Z

Benchmark Results

Benchmark in progress...

codecov · 2025-02-05T11:46:17Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 95.24%. Comparing base (d7cba2a) to head (c99898e).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1015      +/-   ##
==========================================
+ Coverage   95.22%   95.24%   +0.01%     
==========================================
  Files          29       29              
  Lines        1152     1156       +4     
==========================================
+ Hits         1097     1101       +4     
  Misses         55       55

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

abelsiqueira · 2025-02-05T12:21:14Z

I don't know what is the error in the benchmark, but it doesn't seem related to the cancellation due to exceeded time. Does it run the benchmark locally?

datejada · 2025-02-05T14:34:02Z

I don't know what is the error in the benchmark, but it doesn't seem related to the cancellation due to exceeded time. Does it run the benchmark locally?

I forgot to update the duration for the incoming inflows. I fixed and it runs locally.

The results in v0.10.4 are here: #1013 (comment)

I expect this Benchmark to be around the same time, which is good since we achieve more or less the same performance we got in that version, but...20min to create the model is too much :/ Especially since the hourly problem of the whole year (a bigger problem) takes around 5 minutes to create. The bottleneck is this function add_expression_terms_over_clustered_year_constraints! which is too slow...I wonder what else we can do to improve its performance...any ideas?

abelsiqueira · 2025-02-05T14:52:40Z

That's the counterpart of https://github.com/TulipaEnergy/TulipaEnergyModel.jl/pull/987/files#diff-43e9d15ce9a457e188412a614efc15efee4860e85525d1b47ef2892074e90d43, right? Hopefully something similar to what was done there, i.e., change the loops to use DuckDB tables and explicit mark some parts with types. I used Cthulhu and @profview while I was doing that PR to help figure out where to change things.
However, it will be more complicated since there is also a df_map, so it's not immediately clear how to change things. I can start looking into this, to figure out more

datejada · 2025-02-05T15:10:43Z

JuMP.add_to_expression!

So, to recap (for myself and having things clear), we have identified two bottlenecks:

The +=, which is solved using JuMP.add_to_expression! Yesterday, Joaquim and Oscar commented on the JuMP energy models benchmark meeting, explaining why this += is slow and reinforcing that we improve efficiency using JuMP.add_to_expression!
The filtering/subsetting of the dataframes, which might improve with a DuckDB version as the representative period's counterpart.

The first point leaves us at a similar performance level to v0.10.4 (which is still slow for the use cases we are solving now).

The second point is challenging, but it feels the way to go. I will try to think about it, too, but if you have time between the multi-year refactor and this, it would be appreciated.

Thanks!

abelsiqueira · 2025-02-05T15:33:20Z

Yes
Hopefully, yes. Either way, part of the refactor involves this function, so I changing to DuckDB will be necessary. The rep_period counterpart was done more on the objective of refactoring, and some investigation led to it being a bit faster. This might be the case here as well.

I will try to spend some time on it

datejada added the benchmark PR only - Run benchmark on PR label Feb 5, 2025

datejada force-pushed the 1014-improve-performance-v0_11_0 branch from 48bfb7a to f3cda08 Compare February 5, 2025 11:47

datejada added 2 commits February 5, 2025 15:09

Improve performance when creating expressions for timeframe constraints

c53bbb3

Update duration for incoming inflows

c99898e

datejada force-pushed the 1014-improve-performance-v0_11_0 branch from f3cda08 to c99898e Compare February 5, 2025 14:24

datejada requested a review from abelsiqueira February 5, 2025 14:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance v0.11.0 #1015

Improve performance v0.11.0 #1015

datejada commented Feb 5, 2025 •

edited

Loading

github-actions bot commented Feb 5, 2025

codecov bot commented Feb 5, 2025 •

edited

Loading

abelsiqueira commented Feb 5, 2025

datejada commented Feb 5, 2025 •

edited

Loading

abelsiqueira commented Feb 5, 2025

datejada commented Feb 5, 2025

abelsiqueira commented Feb 5, 2025

Improve performance v0.11.0 #1015

Are you sure you want to change the base?

Improve performance v0.11.0 #1015

Conversation

datejada commented Feb 5, 2025 • edited Loading

Related issues

Checklist

github-actions bot commented Feb 5, 2025

Benchmark Results

codecov bot commented Feb 5, 2025 • edited Loading

Codecov Report

abelsiqueira commented Feb 5, 2025

datejada commented Feb 5, 2025 • edited Loading

abelsiqueira commented Feb 5, 2025

datejada commented Feb 5, 2025

abelsiqueira commented Feb 5, 2025

datejada commented Feb 5, 2025 •

edited

Loading

codecov bot commented Feb 5, 2025 •

edited

Loading

datejada commented Feb 5, 2025 •

edited

Loading