Gemini: a Family of Highly Capable Multimodal Models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 507 | 2023 |
Acme: A Research Framework for Distributed Reinforcement Learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020 | 235 | 2020 |
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback H Lee, S Phatale, H Mansoor, T Mesnard, J Ferret, K Lu, C Bishop, E Hall, ... International Conference on Machine Learning (ICML 2024), 2023 | 157 | 2023 |
Adversarially Guided Actor-Critic Y Flet-Berliac*, J Ferret*, O Pietquin, P Preux, M Geist International Conference on Learning Representations (ICLR 2021), 2021 | 67 | 2021 |
Gemma: Open Models Based on Gemini Research and Technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024 | 39 | 2024 |
Self-Attentional Credit Assignment for Transfer in Reinforcement Learning J Ferret, R Marinier, M Geist, O Pietquin International Joint Conference on Artificial Intelligence (IJCAI 2020), 2019 | 31 | 2019 |
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback P Roit*, J Ferret*, L Shani*, R Aharoni, G Cideron, R Dadashi, M Geist, ... ACL, 2023 | 28 | 2023 |
Self-Imitation Advantage Learning J Ferret, O Pietquin, M Geist International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020 | 23 | 2020 |
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning N Grinsztajn*, J Ferret*, O Pietquin, P Preux, M Geist Advances in Neural Information Processing Systems (NeurIPS 2021), 2021 | 21 | 2021 |
Lazy-MDPs: Towards Interpretable Reinforcement Learning By Learning When To Act A Jacq*, J Ferret*, O Pietquin, M Geist International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2022 | 19* | 2022 |
WARM: On the Benefits of Weight Averaged Reward Models A Ramé, N Vieillard, L Hussenot, R Dadashi, G Cideron, O Bachem, ... International Conference on Machine Learning (ICML 2024), 2024 | 9 | 2024 |
Direct Language Model Alignment from Online AI Feedback S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ... arXiv preprint arXiv:2402.04792, 2024 | 6 | 2024 |
Credit assignment as a proxy for transfer in reinforcement learning J Ferret, R Marinier, M Geist, O Pietquin Learning Transferrable Skills Workshop, NeurIPS, 2019 | 6 | 2019 |
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning E Pignatelli, J Ferret, M Geist, T Mesnard, H van Hasselt, L Toni Transactions on Machine Learning Research (TMLR), 2023 | 2 | 2023 |
More efficient exploration with symbolic priors on action sequence equivalences T Johnstone, N Grinsztajn, J Ferret, P Preux Deep Reinforcement Learning Workshop, NeurIPS, 2022 | 2* | 2022 |
On actions that matter: Credit assignment and interpretability in reinforcement learning J Ferret Université de Lille, 2022 | 2 | 2022 |
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ... arXiv preprint arXiv:2404.07839, 2024 | | 2024 |
Offline Credit Assignment in Deep Reinforcement Learning with Hindsight Discriminator Networks J Ferret, O Pietquin, M Geist EWRL, 2022 | | 2022 |