TY - JOUR
T1 - Intelligent IoT Connectivity
T2 - Deep Reinforcement Learning Approach
AU - Kwon, Minhae
AU - Lee, Juhyeon
AU - Park, Hyunggon
N1 - Funding Information:
Manuscript received September 21, 2019; accepted October 24, 2019. Date of publication October 29, 2019; date of current version February 5, 2020. This work was supported in part by the Institute of Information & Communications Technology Planning & Evaluation (IITP) through the Korea Government (MSIT) (Supervised Agile Machine Learning Techniques for Network Automation based on Network Data Analytics Function) under Grant 2019-0-00024, in part by the National Research Foundation of Korea (NRF) through the Korea Government (MSIT) under Grant NRF-2017R1A2B4005041, in part by NSF CAREER IOS-1552868, and in part by the McNair Foundation. This article was presented in part at the Conference on Neural Information Processing Systems (NeurIPS) Deep Reinforcement Learning Workshop, Montreal, Canada, December 2018. The associate editor coordinating the review of this article and approving it for publication was Prof. Okyay Kaynak. (Corresponding author: Hyunggon Park.) M. Kwon was with the Department of Electronic and Electrical Engineering, Ewha Womans University, Seoul 03760, South Korea. She is now with the Department of Electrical and Computer Engineering, Rice University, Houston, TX 77005 USA, and also with the Department of Neuroscience, Center for Neuroscience and Artificial Intelligence, Baylor College of Medicine, Houston, TX 77030 USA (e-mail: minhae.kwon@rice.edu).
Publisher Copyright:
© 2001-2012 IEEE.
PY - 2020/3/1
Y1 - 2020/3/1
N2 - In this paper, we propose a distributed solution to design a multi-hop ad hoc Internet of Things (IoT) network where mobile IoT devices strategically determine their wireless transmission ranges based on a deep reinforcement learning approach. We consider scenarios where only a limited networking infrastructure is available but a large number of IoT devices are deployed in building a multi-hop ad hoc network to deliver source data to the destination. An IoT device is considered as a decision-making agent that strategically determines its transmission range in a way that maximizes network throughput while minimizing the corresponding transmission power consumption. Each IoT device collects information from its partial observations and learns its environment through a sequence of experiences. Hence, the proposed solution requires only a minimal amount of information from the system. We show that the actions that the IoT devices take from its policy are determined as to activate or inactivate its transmission, i.e., only necessary relay nodes are activated with the maximum transmit power, and nonessential nodes are deactivated to minimize power consumption. Using extensive experiments, we confirm that the proposed solution builds a network with higher network performance than the current state-of-the-art solutions in terms of system goodput and connectivity ratio.
AB - In this paper, we propose a distributed solution to design a multi-hop ad hoc Internet of Things (IoT) network where mobile IoT devices strategically determine their wireless transmission ranges based on a deep reinforcement learning approach. We consider scenarios where only a limited networking infrastructure is available but a large number of IoT devices are deployed in building a multi-hop ad hoc network to deliver source data to the destination. An IoT device is considered as a decision-making agent that strategically determines its transmission range in a way that maximizes network throughput while minimizing the corresponding transmission power consumption. Each IoT device collects information from its partial observations and learns its environment through a sequence of experiences. Hence, the proposed solution requires only a minimal amount of information from the system. We show that the actions that the IoT devices take from its policy are determined as to activate or inactivate its transmission, i.e., only necessary relay nodes are activated with the maximum transmit power, and nonessential nodes are deactivated to minimize power consumption. Using extensive experiments, we confirm that the proposed solution builds a network with higher network performance than the current state-of-the-art solutions in terms of system goodput and connectivity ratio.
KW - Intelligent IoT connectivity
KW - deep reinforcement learning
KW - mobile relay networks
KW - network formation
KW - network topology design
KW - wireless ad hoc networks
UR - http://www.scopus.com/inward/record.url?scp=85079682951&partnerID=8YFLogxK
U2 - 10.1109/JSEN.2019.2949997
DO - 10.1109/JSEN.2019.2949997
M3 - Article
AN - SCOPUS:85079682951
SN - 1530-437X
VL - 20
SP - 2782
EP - 2791
JO - IEEE Sensors Journal
JF - IEEE Sensors Journal
IS - 5
M1 - 8886442
ER -