Lyapunov-Guided Deep Reinforcement Learning for Stable Low-Latency Task Offloading in Mobile Edge Computing

Riyadh A S Asbeetah; Alsanousi M  Aboujanah; Ramdan AM Khalifa

doi:10.65421/jibas.v2i2.99

Authors

Riyadh A S Asbeetah Department of Electrical and Electronic Technologies, Higher Institute of Science and Technology, Tamzawah Alshati, Alshati, Libya Author
Alsanousi M Aboujanah Department of Electrical and Electronic Technologies, Higher Institute of Science and Technology, Tamzawah Alshati, Alshati, Libya Author
Ramdan AM Khalifa Department of Electrical and Electronic Technologies, Higher Institute of Science and Technology, Suk-Algumaa, Tripoli, Libya Author

DOI:

https://doi.org/10.65421/jibas.v2i2.99

Keywords:

Mobile Edge Computing (MEC), Deep Reinforcement Learning, Lyapunov Optimization, Task Offloading, DDPG, Queue Stability, 5G, Internet of Things, Latency Minimization

Abstract

Fifth-generation (5G) networks and Internet of Things (IoT) face critical challenges in balancing latency minimization and energy consumption while ensuring data queue stability in dynamic environments. This paper proposes an innovative hybrid framework that integrates stochastic optimization based on Lyapunov theory with the Deep Deterministic Policy Gradient (DDPG) algorithm—termed LG-DDPG (Lyapunov-Guided DDPG). The offloading problem is formulated as a constrained Markov Decision Process (MDP), and a theoretical upper bound for the system cost is derived. The proposed framework employs the Drift-Plus-Penalty (DPP) technique to decouple the long-term stability constraint into instantaneous subproblems, which are then solved by a DDPG-based actor-critic architecture with hidden layers (256, 128, 64) using ReLU and Tanh activations. Comprehensive simulations—averaged over 10 independent runs—demonstrate that LG-DDPG achieves a 35–45% reduction in total system cost compared to state-of-the-art baselines, with an average latency of 45.2 ± 1.3 ms and energy consumption of 2.1 ± 0.1 J, outperforming DRL-only (52.8 ms), Lyapunov-only (58.3 ms), and PSO (65.7 ms) approaches. The system scales linearly to 100+ devices with O(N) complexity, with rigorous mathematical proofs confirming queue stability and neural network convergence.

Lyapunov-Guided Deep Reinforcement Learning for Stable Low-Latency Task Offloading in Mobile Edge Computing

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License

How to Cite

Most read articles by the same author(s)

INDEXED IN CROSSREF

Language

Browse

Information

Make a Submission

Developed By

Latest publications

Navigation

Connect

Stay Updated