Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator

Luo, Yuwei; Yang, Zhuoran; Wang, Zhaoran; Kolar, Mladen

Computer Science > Machine Learning

arXiv:1912.06875 (cs)

[Submitted on 14 Dec 2019 (v1), last revised 24 Dec 2021 (this version, v2)]

Title:Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator

Authors:Yuwei Luo, Zhuoran Yang, Zhaoran Wang, Mladen Kolar

View PDF

Abstract:Multi-agent reinforcement learning has been successfully applied to a number of challenging problems. Despite these empirical successes, theoretical understanding of different algorithms is lacking, primarily due to the curse of dimensionality caused by the exponential growth of the state-action space with the number of agents. We study a fundamental problem of multi-agent linear quadratic regulator (LQR) in a setting where the agents are partially exchangeable. In this setting, we develop a hierarchical actor-critic algorithm, whose computational complexity is independent of the total number of agents, and prove its global linear convergence to the optimal policy. As LQRs are often used to approximate general dynamic systems, this paper provides an important step towards a better understanding of general hierarchical mean-field multi-agent reinforcement learning.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:1912.06875 [cs.LG]
	(or arXiv:1912.06875v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1912.06875

Submission history

From: Yuwei Luo [view email]
[v1] Sat, 14 Dec 2019 16:26:42 UTC (383 KB)
[v2] Fri, 24 Dec 2021 18:50:59 UTC (573 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-12

Change to browse by:

cs
math
math.OC
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zhuoran Yang
Zhaoran Wang
Mladen Kolar

export BibTeX citation

Computer Science > Machine Learning

Title:Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Natural Actor-Critic Converges Globally for Hierarchical Linear Quadratic Regulator

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators