Evaluation of reinforcement learning and model predictive control for apartment heating with heat pump and water storage tank

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorTarvainen, Sami
dc.contributor.authorHirvijoki, Eero
dc.contributor.authorLaukkanen, Timo
dc.contributor.departmentDepartment of Energy and Mechanical Engineeringen
dc.contributor.groupauthorEnergy Conversion and Systemsen
dc.contributor.organizationDepartment of Energy and Mechanical Engineering
dc.date.accessioned2026-02-04T06:36:19Z
dc.date.available2026-02-04T06:36:19Z
dc.date.issued2026-03-30
dc.descriptionPublisher Copyright: © 2026 The Authors
dc.description.abstractOptimising apartment heating systems is becoming increasingly crucial due to growing use of heat pumps and heat storages. This study compares reinforcement learning (RL) and model predictive control (MPC) to optimise and control a simulated apartment heating system. We explore various RL designs, including binary and continuous action spaces, different reward functions, and three storage sizes. We find that MPC outperforms RL when the optimisation period aligns with the actual problem. This is shown in the lower electricity costs of MPC compared to RL with small and base storage sizes. However, the two methods yielded similar electricity costs with the large storage size. We also find that the RL design significantly affects its performance and robustness. The RL model with binary actions and a reward function promoting the active use of storage and profit maximisation outperforms other RL configurations. In turn, a reward function representing the actual problem of cost minimisation was found to be ineffective for agent training. Future studies that compare the two methods for the optimisation of heating systems with long-term storage could focus on extending the prediction horizon or enhancing the terminal cost term in MPC.en
dc.description.versionPeer revieweden
dc.format.extent11
dc.format.mimetypeapplication/pdf
dc.identifier.citationTarvainen, S, Hirvijoki, E & Laukkanen, T 2026, 'Evaluation of reinforcement learning and model predictive control for apartment heating with heat pump and water storage tank', Journal of Energy Storage, vol. 152, 120509. https://doi.org/10.1016/j.est.2026.120509en
dc.identifier.doi10.1016/j.est.2026.120509
dc.identifier.issn2352-152X
dc.identifier.issn2352-1538
dc.identifier.otherPURE UUID: ee000b0c-233b-4300-8893-ca1be4a4f7a7
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/ee000b0c-233b-4300-8893-ca1be4a4f7a7
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/208271470/1-s2.0-S2352152X26001738-main.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/143033
dc.identifier.urnURN:NBN:fi:aalto-202602042395
dc.language.isoenen
dc.publisherElsevier
dc.relation.fundinginfoThis project has received funding from the European Union – NextGenerationEU instrument and is funded by the Research Council of Finland under grant number 353299 .
dc.relation.ispartofseriesJournal of Energy Storageen
dc.relation.ispartofseriesVolume 152en
dc.rightsopenAccessen
dc.rightsCC BY
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subject.keywordApartment space heating
dc.subject.keywordHeat pump
dc.subject.keywordHeat storage
dc.subject.keywordModel predictive control
dc.subject.keywordProximal policy optimisation
dc.subject.keywordReinforcement learning
dc.titleEvaluation of reinforcement learning and model predictive control for apartment heating with heat pump and water storage tanken
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1-s2.0-S2352152X26001738-main.pdf
Size:
1.8 MB
Format:
Adobe Portable Document Format