Abstract
In this paper, we propose an intelligent relay selection scheme employing deep reinforcement learning for a wireless powered cooperative network. We formulate the given problem as a Markov decision process with an unknown transitional probability between states. Therefore, a model-free off-policy relay selection model is proposed. The given model was deployed using a deep Q-network, with an updated relay selection process. Using channel characteristics, we find inaccessible nodes to form a pool of relays available for transmission and encourage the neural network to choose them. In addition, we propose a novel reward policy to train the model that is based on stored energy levels on the relays and promotes the system to expend energy. We numerically quantity the network performance in terms of outage probability and energy outage probability and compare them with the basic Q-learning.
| Original language | English |
|---|---|
| Title of host publication | 2023 International Balkan Conference on Communications and Networking, BalkanCom 2023 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9798350339109 |
| DOIs | |
| Publication status | Published - 2023 |
| Event | 2023 International Balkan Conference on Communications and Networking, BalkanCom 2023 - Istanbul, Turkey Duration: Jun 5 2023 → Jun 8 2023 |
Publication series
| Name | 2023 International Balkan Conference on Communications and Networking, BalkanCom 2023 |
|---|
Conference
| Conference | 2023 International Balkan Conference on Communications and Networking, BalkanCom 2023 |
|---|---|
| Country/Territory | Turkey |
| City | Istanbul |
| Period | 6/5/23 → 6/8/23 |
Funding
VI. ACKNOWLEDGMENT This research is funded by Nazarbayev University under Collaborative Research Program Grant no. 11022021CRP1513 (PI: Galymzhan Nauryzbayev).
Keywords
- outage probability (OP)
- Q-learning
- reinforcement learning (RL)
- Relay selection
- wireless powered communication network (WPCN)
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Networks and Communications
- Hardware and Architecture
- Information Systems
- Safety, Risk, Reliability and Quality
- Instrumentation
Fingerprint
Dive into the research topics of 'Two-Step Deep Reinforcement Q-Learning based Relay Selection in Cooperative WPCNs'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS