Johan Ferret

Alıntı yapanlar

	Hepsi	2019 yılından bugüne
Alıntılar	1154	1153
h-endeksi	10	10
i10-endeksi	10	10

700

350

175

525

2020202120222023202424 85 134 208 696

Katkıda bulunan yazarlar

Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)univ-lorraine.fr üzerinde doğrulanmış e-posta adresine sahip
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)univ-lille.fr üzerinde doğrulanmış e-posta adresine sahip
Thomas MesnardResearch Scientist at Google DeepMindgoogle.com üzerinde doğrulanmış e-posta adresine sahip
Philippe PreuxProfessor of computer science, Université de Lille, LIFL, SequeL, INRIAuniv-lille.fr üzerinde doğrulanmış e-posta adresine sahip
Harrison LeeGoogle Researchgoogle.com üzerinde doğrulanmış e-posta adresine sahip
Samrat PhataleGoogle Researchgoogle.com üzerinde doğrulanmış e-posta adresine sahip
Raphaël MarinierGoogle AIgoogle.com üzerinde doğrulanmış e-posta adresine sahip
Nathan GrinsztajnInriainria.fr üzerinde doğrulanmış e-posta adresine sahip
Nino VieillardGoogle DeepMindgoogle.com üzerinde doğrulanmış e-posta adresine sahip
Léonard HussenotGoogle DeepMindgoogle.com üzerinde doğrulanmış e-posta adresine sahip
Olivier BachemResearch Scientist, Google Braingoogle.com üzerinde doğrulanmış e-posta adresine sahip
Robert DadashiGoogle DeepMindgoogle.com üzerinde doğrulanmış e-posta adresine sahip
Geoffrey CideronGoogle DeepMindgoogle.com üzerinde doğrulanmış e-posta adresine sahip
Yannis Flet-BerliacPostdoc, Stanford Universitystanford.edu üzerinde doğrulanmış e-posta adresine sahip
Alexis D. JacqGooglegoogle.com üzerinde doğrulanmış e-posta adresine sahip
Ramé AlexandreGoogle DeepMindgoogle.com üzerinde doğrulanmış e-posta adresine sahip
Roee AharoniGoogle Researchgoogle.com üzerinde doğrulanmış e-posta adresine sahip
Sabela RamosSoftware Engineer. Google.google.com üzerinde doğrulanmış e-posta adresine sahip
Mathieu BlondelGooglegoogle.com üzerinde doğrulanmış e-posta adresine sahip
Eduardo PignatelliUniversity College Londonucl.ac.uk üzerinde doğrulanmış e-posta adresine sahip

Takip et

Johan Ferret

Research Scientist, Google DeepMind

google.com üzerinde doğrulanmış e-posta adresine sahip - Ana Sayfa

Reinforcement Learning Machine Learning Artificial Intelligence


Başlık Alıntılara göre sırala Yıla göre sırala Başlığa göre sırala	Alıntı yapanlar Alıntı yapanlar	Yıl
Gemini: a Family of Highly Capable Multimodal Models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	507	2023
Acme: A Research Framework for Distributed Reinforcement Learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	235	2020
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback H Lee, S Phatale, H Mansoor, T Mesnard, J Ferret, K Lu, C Bishop, E Hall, ... International Conference on Machine Learning (ICML 2024), 2023	157	2023
Adversarially Guided Actor-Critic Y Flet-Berliac, J Ferret, O Pietquin, P Preux, M Geist International Conference on Learning Representations (ICLR 2021), 2021	67	2021
Gemma: Open Models Based on Gemini Research and Technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	39	2024
Self-Attentional Credit Assignment for Transfer in Reinforcement Learning J Ferret, R Marinier, M Geist, O Pietquin International Joint Conference on Artificial Intelligence (IJCAI 2020), 2019	31	2019
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback P Roit, J Ferret, L Shani*, R Aharoni, G Cideron, R Dadashi, M Geist, ... ACL, 2023	28	2023
Self-Imitation Advantage Learning J Ferret, O Pietquin, M Geist International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020	23	2020
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning N Grinsztajn, J Ferret, O Pietquin, P Preux, M Geist Advances in Neural Information Processing Systems (NeurIPS 2021), 2021	21	2021
Lazy-MDPs: Towards Interpretable Reinforcement Learning By Learning When To Act A Jacq, J Ferret, O Pietquin, M Geist International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2022	19*	2022
WARM: On the Benefits of Weight Averaged Reward Models A Ramé, N Vieillard, L Hussenot, R Dadashi, G Cideron, O Bachem, ... International Conference on Machine Learning (ICML 2024), 2024	9	2024
Direct Language Model Alignment from Online AI Feedback S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ... arXiv preprint arXiv:2402.04792, 2024	6	2024
Credit assignment as a proxy for transfer in reinforcement learning J Ferret, R Marinier, M Geist, O Pietquin Learning Transferrable Skills Workshop, NeurIPS, 2019	6	2019
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning E Pignatelli, J Ferret, M Geist, T Mesnard, H van Hasselt, L Toni Transactions on Machine Learning Research (TMLR), 2023	2	2023
More efficient exploration with symbolic priors on action sequence equivalences T Johnstone, N Grinsztajn, J Ferret, P Preux Deep Reinforcement Learning Workshop, NeurIPS, 2022	2*	2022
On actions that matter: Credit assignment and interpretability in reinforcement learning J Ferret Université de Lille, 2022	2	2022
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ... arXiv preprint arXiv:2404.07839, 2024		2024
Offline Credit Assignment in Deep Reinforcement Learning with Hindsight Discriminator Networks J Ferret, O Pietquin, M Geist EWRL, 2022		2022

Sistem, işlemi şu anda gerçekleştiremiyor. Daha sonra yeniden deneyin.

Makaleler 1–18

Yıllık alıntı sayısı

Mükerrer alıntılar

Birleştirilmiş alıntılar

Katkıda bulunan yazar ekleKatkıda bulunan yazarlar

Takip et

Alıntı yapanlar

Katkıda bulunan yazarlar