Thinh T. Doan
Cited by
Cited by
Finite-Time Analysis of Distributed TD(0) with Linear Function Approximation for Multi-Agent Reinforcement Learning
TT Doan, ST Maguluri, J Romberg
International Conference on Machine Learning, 2019
Performance of Q-learning with Linear Function Approximation: Stability and Finite-Time Analysis
Z Chen, S Zhang, TT Doan, ST Maguluri, JP Clarke
arXiv preprint arXiv:1905.11425, 2019
Fast Convergence Rates of Distributed Subgradient Methods with Adaptive Quantization
TT Doan, ST Maguluri, J Romberg
arXiv preprint arXiv:1810.13245, 2018
Convergence of the iterates in mirror descent methods
TT Doan, S Bose, DH Nguyen, CL Beck
IEEE control systems letters 3 (1), 114-119, 2018
Distributed resource allocation on dynamic networks in quadratic time
TT Doan, A Olshevsky, 2015
On the convergence rate of distributed gradient methods for finite-sum optimization under communication delays
TT Doan, CL Beck, R Srikant
arXiv preprint arXiv:1708.03277, 2017
Distributed Lagrangian Methods for Network Resource Allocation
TT Doan, CL Beck
arXiv preprint arXiv:1609.06287, 2016
Finite-time performance of distributed temporal-difference learning with linear function approximation
TT Doan, ST Maguluri, J Romberg
SIAM Journal on Mathematics of Data Science 3 (1), 298-320, 2021
On the Convergence Rate of Distributed Gradient Methods for Finite-Sum Optimization under Communication Delays
TT Doan, CL Beck, R Srikant
Proceedings of the ACM on Measurement and Analysis of Computing Systems 1 (2 …, 2017
Convergence rates of distributed gradient methods under random quantization: A stochastic approximation approach
TT Doan, ST Maguluri, J Romberg
IEEE Transactions on Automatic Control 66 (10), 4469-4484, 2020
On the geometric convergence rate of distributed economic dispatch/demand response in power networks
TT Doan, A Olshevsky
arXiv preprint arXiv:1609.06660, 2016
Finite-time analysis and restarting scheme for linear two-time-scale stochastic approximation
TT Doan
SIAM Journal on Control and Optimization 59 (4), 2798-2819, 2021
Finite sample analysis of two-time-scale natural actor-critic algorithm
S Khodadadian, TT Doan, ST Maguluri, J Romberg
arXiv preprint arXiv:2101.10506, 2021
Distributed resource allocation over dynamic networks with uncertainty
TT Doan, CL Beck
IEEE Transactions on Automatic Control, 2020
Convergence rates of accelerated markov gradient descent with applications in reinforcement learning
TT Doan, LM Nguyen, NH Pham, J Romberg
arXiv preprint arXiv:2002.02873, 2020
A decentralized policy gradient approach to multi-task reinforcement learning
S Zeng, MA Anwar, TT Doan, A Raychowdhury, J Romberg
Uncertainty in Artificial Intelligence, 1002-1012, 2021
Nonlinear two-time-scale stochastic approximation: Convergence and finite-time performance
TT Doan
arXiv preprint arXiv:2011.01868, 2020
Linear two-time-scale stochastic approximation a finite-time analysis
TT Doan, J Romberg
2019 57th Annual Allerton Conference on Communication, Control, and …, 2019
Convergence rate of distributed subgradient methods under communication delays
TT Doan, CL Beck, R Srikant
2018 Annual American Control Conference (ACC), 5310-5315, 2018
Finite-time analysis of stochastic gradient descent under markov randomness
TT Doan, LM Nguyen, NH Pham, J Romberg
arXiv preprint arXiv:2003.10973, 2020
The system can't perform the operation now. Try again later.
Articles 1–20