Łukasz Kaiser

Cited by

	All	Since 2019
Citations	177228	166045
h-index	52	49
i10-index	85	71

54000

27000

13500

40500

201620172018201920202021202220232024824 2955 6302 11263 18019 27419 37999 53306 18004

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Jakob UszkoreitInceptiveVerified email at uszkoreit.net
Noam ShazeerCharacter.aiVerified email at character.ai
Ashish VaswaniStartupVerified email at fastmail.com
Aidan GomezCohereVerified email at cohere.ai
Illia PolosukhinNEAR.AIVerified email at near.ai
Afroz MohiuddinGoogle IncVerified email at google.com
Oriol VinyalsResearch Scientist at Google DeepMindVerified email at google.com
Samy BengioSenior Director, AI and Machine Learning Research, AppleVerified email at apple.com
Ilya SutskeverCo-Founder and Chief Scientist of OpenAIVerified email at openai.com
Henryk MichalewskiGoogleVerified email at google.com
Anselm LevskayaResearch Scientist, GoogleVerified email at google.com
Stephan GouwsSenior Research Scientist, Google DeepMindVerified email at google.com
George TuckerGoogle BrainVerified email at google.com
Quoc V. LeResearch Scientist, GoogleVerified email at stanford.edu
François CholletGoogleVerified email at google.com
Ben D GoodrichGoogleVerified email at google.com
Piotr KozakowskiUniversity of WarsawVerified email at mimuw.edu.pl
Geoffrey HintonEmeritus Prof. Computer Science, University of TorontoVerified email at cs.toronto.edu
Mohammad SalehGoogle BrainVerified email at google.com
Étienne PotGoogleVerified email at epfl.ch

Łukasz Kaiser

OpenAI & CNRS

Verified email at openai.com - Homepage

Machine Learning & Logic in Computer Science


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Attention is all you need A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... Advances in neural information processing systems 30, 2017	116698	2017
TensorFlow: Large-scale machine learning on heterogeneous systems M Abadi, A Agarwal, P Barham, E Brevdo, Z Chen, C Citro, GS Corrado, ...	24170*	2015
Google's neural machine translation system: Bridging the gap between human and machine translation Y Wu, M Schuster, Z Chen, QV Le, M Norouzi, W Macherey, M Krikun, ... arXiv preprint arXiv:1609.08144, 2016	8359	2016
Attention is all you need A Waswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, A Gomez, ... NIPS, 2017	3544*	2017
Reformer: The efficient transformer N Kitaev, Ł Kaiser, A Levskaya arXiv preprint arXiv:2001.04451, 2020	2225	2020
Evaluating large language models trained on code M Chen, J Tworek, H Jun, Q Yuan, HPO Pinto, J Kaplan, H Edwards, ... arXiv preprint arXiv:2107.03374, 2021	1935	2021
Image transformer N Parmar, A Vaswani, J Uszkoreit, L Kaiser, N Shazeer, A Ku, D Tran International conference on machine learning, 4055-4064, 2018	1797	2018
Rethinking attention with performers K Choromanski, V Likhosherstov, D Dohan, X Song, A Gane, T Sarlos, ... arXiv preprint arXiv:2009.14794, 2020	1262	2020
Regularizing neural networks by penalizing confident output distributions G Pereyra, G Tucker, J Chorowski, Ł Kaiser, G Hinton arXiv preprint arXiv:1701.06548, 2017	1186	2017
Grammar as a foreign language O Vinyals, Ł Kaiser, T Koo, S Petrov, I Sutskever, G Hinton Advances in neural information processing systems 28, 2015	1109	2015
Training verifiers to solve math word problems K Cobbe, V Kosaraju, M Bavarian, M Chen, H Jun, L Kaiser, M Plappert, ... arXiv preprint arXiv:2110.14168, 2021	1050	2021
Attention is all you need. arXiv 2017 A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ... arXiv preprint arXiv:1706.03762 3762, 2023	1025	2023
Multi-task sequence to sequence learning MT Luong, QV Le, I Sutskever, O Vinyals, L Kaiser arXiv preprint arXiv:1511.06114, 2015	926	2015
Generating wikipedia by summarizing long sequences PJ Liu, M Saleh, E Pot, B Goodrich, R Sepassi, L Kaiser, N Shazeer arXiv preprint arXiv:1801.10198, 2018	906	2018
Universal transformers M Dehghani, S Gouws, O Vinyals, J Uszkoreit, Ł Kaiser arXiv preprint arXiv:1807.03819, 2018	887	2018
Model-based reinforcement learning for atari L Kaiser, M Babaeizadeh, P Milos, B Osinski, RH Campbell, ... arXiv preprint arXiv:1903.00374, 2019	874	2019
TensorFlow: Large-scale machine learning on heterogeneous systems, software available from tensorflow. org (2015) M Abadi, A Agarwal, P Barham, E Brevdo, Z Chen, C Citro, GS Corrado, ... URL https://www. tensorflow. org, 2015	833	2015
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023	683	2023
Tensor2tensor for neural machine translation A Vaswani, S Bengio, E Brevdo, F Chollet, AN Gomez, S Gouws, L Jones, ... arXiv preprint arXiv:1803.07416, 2018	609	2018
Adding gradient noise improves learning for very deep networks A Neelakantan, L Vilnis, QV Le, I Sutskever, L Kaiser, K Kurach, J Martens arXiv preprint arXiv:1511.06807, 2015	574	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors