Exploration through reward biasing: Reward-biased maximum likelihood estimation for stochastic multi-armed bandits X Liu, PC Hsieh, YH Hung, A Bhattacharya, P Kumar International Conference on Machine Learning, 6248-6258, 2020 | 12 | 2020 |
Reward-biased maximum likelihood estimation for linear stochastic bandits YH Hung, PC Hsieh, X Liu, PR Kumar Proceedings of the AAAI Conference on Artificial Intelligence 35 (9), 7874-7882, 2021 | 10 | 2021 |
Reward-biased maximum likelihood estimation for neural contextual bandits: a distributional learning perspective YH Hung, PC Hsieh Proceedings of the AAAI Conference on Artificial Intelligence 37 (7), 7944-7952, 2023 | 2 | 2023 |
Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs YH Hung, PC Hsieh, A Mete, PR Kumar arXiv preprint arXiv:2310.11515, 2023 | | 2023 |
Reward-Biased Maximum Likelihood Estimation for Neural Contextual Bandits YH Hung, PC Hsieh arXiv preprint arXiv:2203.04192, 2022 | | 2022 |
Neural Contextual Bandits via Reward-Biased Maximum Likelihood Estimation. YH Hung, PC Hsieh CoRR, 2022 | | 2022 |