A simulation environment for training a reinforcement learning agent trading a battery storage

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorAaltonen, Harrien_US
dc.contributor.authorSierla, Seppoen_US
dc.contributor.authorSubramanya, Rakshithen_US
dc.contributor.authorVyatkin, Valeriyen_US
dc.contributor.departmentDepartment of Electrical Engineering and Automationen
dc.contributor.groupauthorInformation Technologies in Industrial Automationen
dc.contributor.organizationDepartment of Electrical Engineering and Automationen_US
dc.date.accessioned2021-09-22T06:31:26Z
dc.date.available2021-09-22T06:31:26Z
dc.date.issued2021-09-06en_US
dc.descriptionFunding Information: Funding: This research was supported by Business Finland grant 7439/31/2018. Publisher Copyright: © 2021 by the authors. Licensee MDPI, Basel, Switzerland.
dc.description.abstractBattery storages are an essential element of the emerging smart grid. Compared to other distributed intelligent energy resources, batteries have the advantage of being able to rapidly react to events such as renewable generation fluctuations or grid disturbances. There is a lack of research on ways to profitably exploit this ability. Any solution needs to consider rapid electrical phenomena as well as the much slower dynamics of relevant electricity markets. Reinforcement learning is a branch of artificial intelligence that has shown promise in optimizing complex problems involving uncertainty. This article applies reinforcement learning to the problem of trading batteries. The problem involves two timescales, both of which are important for profitability. Firstly, trading the battery capacity must occur on the timescale of the chosen electricity markets. Secondly, the real-time operation of the battery must ensure that no financial penalties are incurred from failing to meet the technical specification. The trading‐related decisions must be done under uncertainties, such as unknown future market prices and unpredictable power grid disturbances. In this article, a simulation model of a battery system is proposed as the environment to train a reinforcement learning agent to make such decisions. The system is demonstrated with an application of the battery to Finnish primary frequency reserve markets.en
dc.description.versionPeer revieweden
dc.format.extent20
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationAaltonen, H, Sierla, S, Subramanya, R & Vyatkin, V 2021, ' A simulation environment for training a reinforcement learning agent trading a battery storage ', Energies, vol. 14, no. 17, 5587 . https://doi.org/10.3390/en14175587en
dc.identifier.doi10.3390/en14175587en_US
dc.identifier.issn1996-1073
dc.identifier.otherPURE UUID: 72b265af-1d42-4f6f-9fa2-8515f36e88caen_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/72b265af-1d42-4f6f-9fa2-8515f36e88caen_US
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=85114611521&partnerID=8YFLogxKen_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/67562242/ELEC_Aaltonen_etal_A_Simulation_Environment_for_Training_a_Reinforcement_Learning_Energies_2021.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/110096
dc.identifier.urnURN:NBN:fi:aalto-202109229319
dc.language.isoenen
dc.publisherMDPI AG
dc.relation.ispartofseriesEnergiesen
dc.relation.ispartofseriesVolume 14, issue 17en
dc.rightsopenAccessen
dc.subject.keywordArtificial intelligenceen_US
dc.subject.keywordBatteryen_US
dc.subject.keywordElectricity marketen_US
dc.subject.keywordFrequency containment reserveen_US
dc.subject.keywordFrequency reserveen_US
dc.subject.keywordReal‐timeen_US
dc.subject.keywordReinforcement learningen_US
dc.subject.keywordSimulationen_US
dc.subject.keywordTimescaleen_US
dc.titleA simulation environment for training a reinforcement learning agent trading a battery storageen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files