Structured Reinforcement Learning for Media Streaming at the Wireless Edge

Bura, Archana; Bobbili, Sarat Chandra; Rameshkumar, Shreyas; Rengarajan, Desik; Kalathil, Dileep; Shakkottai, Srinivas

Electrical Engineering and Systems Science > Systems and Control

arXiv:2404.07315 (eess)

[Submitted on 10 Apr 2024 (v1), last revised 16 Apr 2024 (this version, v2)]

Title:Structured Reinforcement Learning for Media Streaming at the Wireless Edge

Authors:Archana Bura, Sarat Chandra Bobbili, Shreyas Rameshkumar, Desik Rengarajan, Dileep Kalathil, Srinivas Shakkottai

View PDF HTML (experimental)

Abstract:Media streaming is the dominant application over wireless edge (access) networks. The increasing softwarization of such networks has led to efforts at intelligent control, wherein application-specific actions may be dynamically taken to enhance the user experience. The goal of this work is to develop and demonstrate learning-based policies for optimal decision making to determine which clients to dynamically prioritize in a video streaming setting. We formulate the policy design question as a constrained Markov decision problem (CMDP), and observe that by using a Lagrangian relaxation we can decompose it into single-client problems. Further, the optimal policy takes a threshold form in the video buffer length, which enables us to design an efficient constrained reinforcement learning (CRL) algorithm to learn it. Specifically, we show that a natural policy gradient (NPG) based algorithm that is derived using the structure of our problem converges to the globally optimal policy. We then develop a simulation environment for training, and a real-world intelligent controller attached to a WiFi access point for evaluation. We empirically show that the structured learning approach enables fast learning. Furthermore, such a structured policy can be easily deployed due to low computational complexity, leading to policy execution taking only about 15$\mu$s. Using YouTube streaming experiments in a resource constrained scenario, we demonstrate that the CRL approach can increase quality of experience (QOE) by over 30\%.

Comments:	15 pages, 14 figures
Subjects:	Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2404.07315 [eess.SY]
	(or arXiv:2404.07315v2 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2404.07315

Submission history

From: Archana Bura [view email]
[v1] Wed, 10 Apr 2024 19:25:51 UTC (1,021 KB)
[v2] Tue, 16 Apr 2024 22:32:34 UTC (1,027 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Structured Reinforcement Learning for Media Streaming at the Wireless Edge

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Structured Reinforcement Learning for Media Streaming at the Wireless Edge

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators