Reinforcement Learning Framework for Delay Sensitive Energy Harvesting Wireless Sensor Networks
Loading...
Access rights
openAccess
acceptedVersion
URL
Journal Title
Journal ISSN
Volume Title
A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Other link related to publication (opens in new window)
Authors
Date
2021-03-01
Major/Subject
Mcode
Degree programme
Language
en
Pages
11
Series
IEEE Sensors Journal, Volume 21, issue 5, pp. 7103-7113
Abstract
A multi-hop energy harvesting wireless sensor network (EH-WSNs) is a key enabler for future communication systems such as the internet-of-things. Optimal power management and routing selection are important for the operation and successful deployment of EH-WSNs. Characterizing the optimal policies increases significantly with the number of nodes in the network. In this paper, optimal control policy is devised based on minimum-delay transmission in a multi-hop EH-WSN using reinforcement learning (RL). The WSN consists of M EH sensor nodes aiming to transmit their data to a sink node with a minimum delay. Each sensor node is equipped with a battery of limited capacity to save the harvested energy and a data buffer of limited size to store both the sensed and relayed data from neighboring nodes. Centralized and distributed RL algorithms are considered for EH-WSNs. In the centralized RL algorithm the control action is taken at a central unit using the state information of all sensor nodes. In the distributed RL algorithm the control action is taken locally at each sensor node using its state of information and the state information of neighboring nodes. The proposed RL algorithms are based on the state-action-reward-state-action (SARSA) algorithm. Simulation results demonstrate the merits of the proposed algorithms.Description
Keywords
action-value-function approximation, energy harvesting, reinforcement learning, SARSA, Wireless sensor network
Other note
Citation
Al-Tous, H & Barhumi, I 2021, ' Reinforcement Learning Framework for Delay Sensitive Energy Harvesting Wireless Sensor Networks ', IEEE Sensors Journal, vol. 21, no. 5, 9292079, pp. 7103-7113 . https://doi.org/10.1109/JSEN.2020.3044049