A Multi-Stage Warm-Start Deep Learning Framework for Unit Commitment

2026-04-23 • Artificial Intelligence

Artificial Intelligence

AI summaryⓘ

The authors address the challenge of balancing electricity supply and demand by solving a complex scheduling problem called Unit Commitment (UC), which becomes harder as renewable energy and storage technologies are added to the grid. They develop a new method using a transformer-based neural network to predict multi-day generator schedules quickly. To ensure these predictions meet physical rules, the authors combine the model with rules that fix issues and then use these better guesses to help the traditional solver find a solution faster. Testing on a small system showed their method always found valid schedules and often saved time and cost compared to standard approaches.

Unit CommitmentMixed-Integer Linear ProgrammingTransformerSelf-AttentionRenewable Energy IntegrationLong Duration StorageFeasibilityWarm StartVariable FixationElectric Grid Reliability

Authors

Muhy Eddin Za'ter, Anna Van Boven, Bri-Mathias Hodge, Kyri Baker

Abstract

Maintaining instantaneous balance between electricity supply and demand is critical for reliability and grid instability. System operators achieve this through solving the task of Unit Commitment (UC),ca high dimensional large-scale Mixed-integer Linear Programming (MILP) problem that is strictly and heavily governed by the grid physical constraints. As grid integrate variable renewable sources, and new technologies such as long duration storage in the grid, UC must be optimally solved for multi-day horizons and potentially with greater frequency. Therefore, traditional MILP solvers increasingly struggle to compute solutions within these tightening operational time limits. To bypass these computational bottlenecks, this paper proposes a novel framework utilizing a transformer-based architecture to predict generator commitment schedules over a 72-hour horizon. Also, because raw predictions in highly dimensional spaces often yield physically infeasible results, the pipeline integrates the self-attention network with deterministic post-processing heuristics that systematically enforce minimum up/down times and minimize excess capacity. Finally, these refined predictions are utilized as a warm start for a downstream MILP solver, while employing a confidence-based variable fixation strategy to drastically reduce the combinatorial search space. Validated on a single-bus test system, the complete multi-stage pipeline achieves 100\% feasibility and significantly accelerates computation times. Notably, in approximately 20\% of test instances, the proposed model reached a feasible operational schedule with a lower overall system cost than relying solely on the solver.

View PDFOpen arXiv