Takip et
Romain Laroche
Romain Laroche
Microsoft Research
polytechnique.org üzerinde doğrulanmış e-posta adresine sahip - Ana Sayfa
Başlık
Alıntı yapanlar
Alıntı yapanlar
Yıl
Hybrid reward architecture for reinforcement learning
H Van Seijen, M Fatemi, J Romoff, R Laroche, T Barnes, J Tsang
Advances in Neural Information Processing Systems 30, 2017
2502017
Safe policy improvement with baseline bootstrapping
R Laroche, P Trichelair, RT Des Combes
International conference on machine learning, 3652-3661, 2019
2122019
Learning dynamic belief graphs to generalize on text-based games
A Adhikari, X Yuan, MA Côté, M Zelinka, MA Rondeau, R Laroche, ...
Advances in Neural Information Processing Systems 33, 3045-3057, 2020
1002020
Contextual bandit for active learning: Active thompson sampling
D Bouneffouf, R Laroche, T Urvoy, R Féraud, R Allesiardo
Neural Information Processing: 21st International Conference, ICONIP 2014 …, 2014
932014
Counting to explore and generalize in text-based games
X Yuan, MA Côté, A Sordoni, R Laroche, RT Combes, M Hausknecht, ...
arXiv preprint arXiv:1806.11525, 2018
592018
Transfer reinforcement learning with shared dynamics
R Laroche, M Barlier
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
592017
When does return-conditioned supervised learning work for offline reinforcement learning?
D Brandfonbrener, A Bietti, J Buckman, R Laroche, J Bruna
Advances in Neural Information Processing Systems 35, 1542-1553, 2022
522022
Score-based inverse reinforcement learning
L El Asri, B Piot, M Geist, R Laroche, O Pietquin
International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2016
452016
Reinforcement learning algorithm selection
R Laroche, R Feraud
ICLR, 2018
392018
Hybrid reward architecture for reinforcement learning
HH Van Seijen, SMF Booshehri, RMH Laroche, JS Romoff
US Patent 10,977,551, 2021
382021
Safe policy improvement with soft baseline bootstrapping
K Nadjahi, R Laroche, R Tachet des Combes
Machine Learning and Knowledge Discovery in Databases: European Conference …, 2020
342020
Transfer Learning for User Adaptation in Spoken Dialogue Systems.
A Genevay, R Laroche
AAMAS, 975-983, 2016
332016
Human-machine dialogue as a stochastic game
M Barlier, J Perolat, R Laroche, O Pietquin
16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), 2015
312015
NASTIA: Negotiating Appointment Setting Interface.
L El Asri, R Lemonnier, R Laroche, O Pietquin, H Khouzaimi
LREC, 266-271, 2014
302014
Reward function learning for dialogue management
L El Asri, R Laroche, O Pietquin
STAIRS 2012, 95-106, 2012
292012
Reward shaping for statistical optimisation of dialogue management
L El Asri, R Laroche, O Pietquin
Statistical Language and Speech Processing: First International Conference …, 2013
282013
Decentralized exploration in multi-armed bandits
R Féraud, R Alami, R Laroche
International Conference on Machine Learning, 1901-1909, 2019
262019
On value function representation of long horizon problems
L Lehnert, R Laroche, H van Seijen
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
252018
Optimising turn-taking strategies with reinforcement learning
H Khouzaimi, R Laroche, F Lefevre
Proceedings of the 16th Annual Meeting of the Special Interest Group on …, 2015
252015
Hybridisation of expertise and reinforcement learning in dialogue systems
R Laroche, G Putois, P Bretier, B Bouchon-Meunier
Tenth Annual Conference of the International Speech Communication Association, 2009
252009
Sistem, işlemi şu anda gerçekleştiremiyor. Daha sonra yeniden deneyin.
Makaleler 1–20