Edward Hughes
Edward Hughes
Staff Research Engineer, DeepMind
google.com üzerinde doğrulanmış e-posta adresine sahip - Ana Sayfa
Başlık
Alıntı yapanlar
Alıntı yapanlar
Yıl
Social influence as intrinsic motivation for multi-agent deep reinforcement learning
N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ...
International Conference on Machine Learning, 3040-3049, 2019
172*2019
The hanabi challenge: A new frontier for ai research
N Bard, JN Foerster, S Chandar, N Burch, M Lanctot, HF Song, E Parisotto, ...
Artificial Intelligence 280, 103216, 2020
1282020
Inequity aversion improves cooperation in intertemporal social dilemmas
E Hughes, JZ Leibo, MG Phillips, K Tuyls, EA Duéñez-Guzmán, ...
arXiv preprint arXiv:1803.08884, 2018
972018
Learning to follow language instructions with adversarial reward induction
D Bahdanau, F Hill, J Leike, E Hughes, P Kohli, E Grefenstette
arXiv preprint arXiv:1806.01946, 2018
90*2018
Bayesian action decoder for deep multi-agent reinforcement learning
J Foerster, F Song, E Hughes, N Burch, I Dunning, S Whiteson, ...
International Conference on Machine Learning, 1942-1951, 2019
792019
OpenSpiel: A framework for reinforcement learning in games
M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ...
arXiv preprint arXiv:1908.09453, 2019
622019
Causal reasoning from meta-reinforcement learning
I Dasgupta, J Wang, S Chiappa, J Mitrovic, P Ortega, D Raposo, ...
arXiv preprint arXiv:1901.08162, 2019
512019
Autocurricula and the emergence of innovation from social interaction: A manifesto for multi-agent intelligence research
JZ Leibo, E Hughes, M Lanctot, T Graepel
arXiv preprint arXiv:1903.00742, 2019
442019
Evolving intrinsic motivations for altruistic behavior
JX Wang, E Hughes, C Fernando, WM Czarnecki, EA Duéñez-Guzmán, ...
arXiv preprint arXiv:1811.05931, 2018
362018
A generalized training approach for multiagent learning
P Muller, S Omidshafiei, M Rowland, K Tuyls, J Perolat, S Liu, D Hennes, ...
arXiv preprint arXiv:1909.12823, 2019
302019
The connected prescription for form factors in twistor space
A Brandhuber, E Hughes, R Panerai, B Spence, G Travaglini
Journal of High Energy Physics 2016 (11), 1-17, 2016
262016
Search for a Structure in the Invariant Mass Spectrum with the ATLAS Experiment
M Aaboud, G Aad, B Abbott, B Abeloos, SH Abidi, OS AbouZeid, ...
Physical review letters 120 (20), 202007, 2018
232018
Malthusian reinforcement learning
JZ Leibo, J Perolat, E Hughes, S Wheelwright, AH Marblestone, ...
arXiv preprint arXiv:1812.07019, 2018
202018
Open problems in cooperative AI
A Dafoe, E Hughes, Y Bachrach, T Collins, KR McKee, JZ Leibo, K Larson, ...
arXiv preprint arXiv:2012.08630, 2020
192020
Learning reciprocity in complex sequential social dilemmas
T Eccles, E Hughes, J Kramár, S Wheelwright, JZ Leibo
AAMAS, 1934-1936, 2019
18*2019
Social diversity and social preferences in mixed-motive reinforcement learning
KR McKee, I Gemp, B McWilliams, EA Duéñez-Guzmán, E Hughes, ...
arXiv preprint arXiv:2002.02325, 2020
172020
One-loop soft theorems via dual superconformal symmetry
A Brandhuber, E Hughes, B Spence, G Travaglini
Journal of High Energy Physics 2016 (3), 1-44, 2016
142016
Top-quark mass measurement in the all-hadronic t t ¯ decay channel at s = 8 TeV …
M Aaboud, G Aad, B Abbott, J Abdallah, B Abeloos, R Aben, OS AbouZeid, ...
Journal of High Energy Physics 2017 (9), 1-41, 2017
122017
Bounds and dynamics for empirical game theoretic analysis
K Tuyls, J Perolat, M Lanctot, E Hughes, R Everett, JZ Leibo, ...
Autonomous Agents and Multi-Agent Systems 34 (1), 1-30, 2020
112020
Smooth markets: A basic mechanism for organizing gradient-based learners
D Balduzzi, WM Czarnecki, TW Anthony, IM Gemp, E Hughes, JZ Leibo, ...
arXiv preprint arXiv:2001.04678, 2020
92020
Sistem, işlemi şu anda gerçekleştiremiyor. Daha sonra yeniden deneyin.
Makaleler 1–20