A deep reinforcement learning-based multi-optimality routing scheme for dynamic IoT networks

Peizhuang Cong; Yuchao Zhang; Zheli Liu; Thar Baker; Hissam Tawfik; Wendong Wang; Ke Xu; Ruidong Li; Fuliang Li

doi:10.1016/j.comnet.2021.108057

A deep reinforcement learning-based multi-optimality routing scheme for dynamic IoT networks

Peizhuang Cong, Yuchao Zhang, Zheli Liu, Thar Baker, Hissam Tawfik, Wendong Wang, Ke Xu, Ruidong Li, Fuliang Li

School of Arch, Tech and Eng

Research output: Contribution to journal › Article › peer-review

Abstract

With the development of Internet of Things (IoT) and 5G technologies, more and more applications, such as autonomous vehicles and tele-medicine, become more sensitive to network latency and accuracy, which require routing schemes to be more flexible and efficient. In order to meet such urgent need, learning-based routing strategies are emerging as strong candidate solutions, with the advantages of high flexibility and accuracy. These strategies can be divided into two categories, centralized and distributed, enjoying the advantages of high precision and high efficiency, respectively. However, routing becomes more complex in dynamic IoT network, where the link connections and access states are time-varying, hence these learning-based routing mechanisms are required to have the capability to adapt to network changes in real time. In this paper, we designed and implemented both centralized and distributed Reinforcement Learning-based Routing schemes combined with Multi-optimality routing criteria (RLR-M). By conducting a series of experiments, we performed a comprehensive analysis of the results and arrived at the conclusion that the centralized is better suited to cope with dynamic networks due to its faster reconvergence (2.2 over distributed), while the distributed is better positioned to handle with large-scale networks through its high scalability (1.6 over centralized). Moreover, the multi-optimality routing scheme is implemented through model fusion, which is more flexible than traditional strategies and as such is better placed to meet the needs of IoT.

Original language	English
Article number	108057
Journal	Computer Networks
Volume	192
DOIs	https://doi.org/10.1016/j.comnet.2021.108057
Publication status	Published - 3 Apr 2021

Access to Document

10.1016/j.comnet.2021.108057

Cite this

@article{f7530816dbfa49e7948f1428655933e2,

title = "A deep reinforcement learning-based multi-optimality routing scheme for dynamic IoT networks",

abstract = "With the development of Internet of Things (IoT) and 5G technologies, more and more applications, such as autonomous vehicles and tele-medicine, become more sensitive to network latency and accuracy, which require routing schemes to be more flexible and efficient. In order to meet such urgent need, learning-based routing strategies are emerging as strong candidate solutions, with the advantages of high flexibility and accuracy. These strategies can be divided into two categories, centralized and distributed, enjoying the advantages of high precision and high efficiency, respectively. However, routing becomes more complex in dynamic IoT network, where the link connections and access states are time-varying, hence these learning-based routing mechanisms are required to have the capability to adapt to network changes in real time. In this paper, we designed and implemented both centralized and distributed Reinforcement Learning-based Routing schemes combined with Multi-optimality routing criteria (RLR-M). By conducting a series of experiments, we performed a comprehensive analysis of the results and arrived at the conclusion that the centralized is better suited to cope with dynamic networks due to its faster reconvergence (2.2 over distributed), while the distributed is better positioned to handle with large-scale networks through its high scalability (1.6 over centralized). Moreover, the multi-optimality routing scheme is implemented through model fusion, which is more flexible than traditional strategies and as such is better placed to meet the needs of IoT.",

author = "Peizhuang Cong and Yuchao Zhang and Zheli Liu and Thar Baker and Hissam Tawfik and Wendong Wang and Ke Xu and Ruidong Li and Fuliang Li",

year = "2021",

month = apr,

day = "3",

doi = "10.1016/j.comnet.2021.108057",

language = "English",

volume = "192",

journal = "Computer Networks",

publisher = "Elsevier",

}

TY - JOUR

T1 - A deep reinforcement learning-based multi-optimality routing scheme for dynamic IoT networks

AU - Cong, Peizhuang

AU - Zhang, Yuchao

AU - Liu, Zheli

AU - Baker, Thar

AU - Tawfik, Hissam

AU - Wang, Wendong

AU - Xu, Ke

AU - Li, Ruidong

AU - Li, Fuliang

PY - 2021/4/3

Y1 - 2021/4/3

N2 - With the development of Internet of Things (IoT) and 5G technologies, more and more applications, such as autonomous vehicles and tele-medicine, become more sensitive to network latency and accuracy, which require routing schemes to be more flexible and efficient. In order to meet such urgent need, learning-based routing strategies are emerging as strong candidate solutions, with the advantages of high flexibility and accuracy. These strategies can be divided into two categories, centralized and distributed, enjoying the advantages of high precision and high efficiency, respectively. However, routing becomes more complex in dynamic IoT network, where the link connections and access states are time-varying, hence these learning-based routing mechanisms are required to have the capability to adapt to network changes in real time. In this paper, we designed and implemented both centralized and distributed Reinforcement Learning-based Routing schemes combined with Multi-optimality routing criteria (RLR-M). By conducting a series of experiments, we performed a comprehensive analysis of the results and arrived at the conclusion that the centralized is better suited to cope with dynamic networks due to its faster reconvergence (2.2 over distributed), while the distributed is better positioned to handle with large-scale networks through its high scalability (1.6 over centralized). Moreover, the multi-optimality routing scheme is implemented through model fusion, which is more flexible than traditional strategies and as such is better placed to meet the needs of IoT.

AB - With the development of Internet of Things (IoT) and 5G technologies, more and more applications, such as autonomous vehicles and tele-medicine, become more sensitive to network latency and accuracy, which require routing schemes to be more flexible and efficient. In order to meet such urgent need, learning-based routing strategies are emerging as strong candidate solutions, with the advantages of high flexibility and accuracy. These strategies can be divided into two categories, centralized and distributed, enjoying the advantages of high precision and high efficiency, respectively. However, routing becomes more complex in dynamic IoT network, where the link connections and access states are time-varying, hence these learning-based routing mechanisms are required to have the capability to adapt to network changes in real time. In this paper, we designed and implemented both centralized and distributed Reinforcement Learning-based Routing schemes combined with Multi-optimality routing criteria (RLR-M). By conducting a series of experiments, we performed a comprehensive analysis of the results and arrived at the conclusion that the centralized is better suited to cope with dynamic networks due to its faster reconvergence (2.2 over distributed), while the distributed is better positioned to handle with large-scale networks through its high scalability (1.6 over centralized). Moreover, the multi-optimality routing scheme is implemented through model fusion, which is more flexible than traditional strategies and as such is better placed to meet the needs of IoT.

U2 - 10.1016/j.comnet.2021.108057

DO - 10.1016/j.comnet.2021.108057

M3 - Article

VL - 192

JO - Computer Networks

JF - Computer Networks

M1 - 108057

ER -

A deep reinforcement learning-based multi-optimality routing scheme for dynamic IoT networks

Abstract

Access to Document

Fingerprint

Cite this