Bandit-Based Power Control in Full-Duplex Cooperative Relay Networks with Strict-Sense Stationary and Non-Stationary Wireless Communication Channels

Loading...
Thumbnail Image

Access rights

openAccess
publishedVersion

URL

Journal Title

Journal ISSN

Volume Title

A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä

Date

2022-02-28

Major/Subject

Mcode

Degree programme

Language

en

Pages

13

Series

IEEE Open Journal of the Communications Society, Volume 3, pp. 366-378

Abstract

Full-duplex relaying is an enabling technique of sixth generation (6G) mobile networks, promising tremendous rate and spectral efficiency gains. In order to improve the performance of full-duplex communications, power control is a viable way of avoiding excessive loop interference at the relay. Unfortunately, power control requires channel state information of source-relay, relay-destination and loop interference channels, thus resulting in increased overheads. Aiming to offer a low-complexity alternative for power control in such networks, we adopt reward-based learning in the sense of multi-armed bandits. More specifically, we present bandit-based power control, relying on acknowledgements/negative-acknowledgements observations by the relay. Our distributed algorithms avoid channel state information acquisition and exchange, and can alleviate the impact of outdated channel state information. Two cases are examined regarding the channel statistics of the wireless network, namely, strict-sense stationary and non-stationary channels. For the latter, a sliding window approach is adopted to further improve the performance. Performance evaluation highlights a performance-complexity trade-off, compared to optimal power control with full channel knowledge and significant gains over cases considering channel estimation and feedback overheads, outdated channel knowledge, no power control and random power level selection. Finally, it is shown that the sliding-window bandit-based algorithm provides improved performance in non-stationary settings by efficiently adapting to abrupt changes of the wireless channels.

Description

Publisher Copyright: Author

Keywords

Channel estimation, Encoding, Full-duplex relaying, multi-armed bandits, non-stationary wireless channels, outdated CSI, power control, Power control, reinforcement learning, Relay networks (telecommunication), Resource management, sliding-window, upper confidence bound policies., Wireless communication, Wireless sensor networks

Other note

Citation

Nomikos, N, Talebi, M S, Charalambous, T & Wichman, R 2022, ' Bandit-Based Power Control in Full-Duplex Cooperative Relay Networks with Strict-Sense Stationary and Non-Stationary Wireless Communication Channels ', IEEE Open Journal of the Communications Society, vol. 3, pp. 366-378 . https://doi.org/10.1109/OJCOMS.2022.3154292