Top 100 AI Researchers Who Communicate
102people
001002003004005006007008009010011012013014015016017018019020021022023024025026027028029030031032033034035036037038039040041042043044045046047048049050051052053054055056057058059060061062063064065066067068069070071072073074075076077078079080081082083084085086087088089090091092093094095096097098099100101102
Andrej Karpathy
"Software 2.0", neural net zero-to-hero lecture series, nanoGPT
Ian Goodfellow
GANs; co-author of the Deep Learning textbook
Richard Sutton
"The Bitter Lesson"; reinforcement learning textbook
Yoshua Bengio
deep learning foundations; AI risk essays
Demis Hassabis
DeepMind founder; AlphaGo/AlphaFold communication
Dario Amodei
"Machines of Loving Grace"; Anthropic scaling essays
Ilya Sutskever
scaling hypothesis advocacy; seminal lectures on representation
Yann LeCun
convolutional nets; JEPA / world-model essays
Chris Olah
Distill.pub founder; mechanistic interpretability primers
Geoffrey Hinton
backprop, capsules; public AI-risk communication
Andrew Ng
Coursera ML course; "AI is the new electricity"
Fei-Fei Li
ImageNet; "The Worlds I See"; human-centered AI framing
Sebastian Raschka
"Build a Large Language Model from Scratch"
Lilian Weng
long-form ML blog (lilianweng.github.io)
Jay Alammar
"Illustrated Transformer", "Illustrated GPT-2"
Sasha Rush
Annotated Transformer; minGPT-style pedagogy
Jeremy Howard
fast.ai courses; ULMFiT
Rachel Thomas
fast.ai co-founder; AI ethics essays
Stuart Russell
"Human Compatible"; provably beneficial AI
Pedro Domingos
"The Master Algorithm"; five tribes framework
Michael Nielsen
"Neural Networks and Deep Learning" online book
David MacKay
"Information Theory, Inference, and Learning Algorithms"
Christopher Bishop
"Pattern Recognition and Machine Learning"
Kevin Murphy
"Probabilistic Machine Learning" textbooks
Jurgen Schmidhuber
LSTM; deep history of deep learning essays
Pieter Abbeel
Berkeley RL lectures; Covariant founder essays
Sergey Levine
Berkeley deep RL course; offline RL writing
Chelsea Finn
meta-learning (MAML); Stanford CS330 lectures
Percy Liang
HELM benchmark; foundation-model framing
Christopher Manning
Stanford NLP course; CS224n
Dan Jurafsky
"Speech and Language Processing" textbook
Emily Bender
"Stochastic Parrots"; octopus-test framing
Timnit Gebru
algorithmic-bias research; DAIR Institute
Margaret Mitchell
Model Cards framework
Arvind Narayanan
"AI Snake Oil"; CS-fairness writing
Sayash Kapoor
"AI Snake Oil" co-author
Anthropic Interpretability Team
circuits, sparse autoencoder, monosemanticity reports
Neel Nanda
mechanistic interpretability tutorials and TransformerLens
Jacob Steinhardt
emergent capabilities essays; Berkeley alignment
Paul Christiano
RLHF; alignment-research essays
Jan Leike
superalignment writing; OpenAI/Anthropic
Holden Karnofsky
"Most important century" series
Ajeya Cotra
biological-anchors timelines report
Eliezer Yudkowsky
"Sequences"; AI-risk arguments
Gwern Branwen
long-form scaling-law and tooling essays
Nathan Lambert
Interconnects; RLHF book
Sebastian Ruder
NLP newsletter; transfer-learning surveys
Yannic Kilcher
paper-explained YouTube channel
Andrew Trask
"Grokking Deep Learning"; OpenMined
François Chollet
Keras; ARC benchmark; "On the Measure of Intelligence"
Aurélien Géron
"Hands-On Machine Learning with Scikit-Learn & TensorFlow"
Charles Isbell
Georgia Tech ML lectures; community essays
Michael Littman
RL lectures; "code that runs other code" framing
Tom Mitchell
canonical ML textbook; "well-posed learning problem"
Daphne Koller
probabilistic graphical models textbook
Judea Pearl
causal inference; "The Book of Why"
Cynthia Rudin
interpretable-ML manifesto
Been Kim
TCAV; concept-based interpretability
Finale Doshi-Velez
interpretable-ML rigor framework
Zachary Lipton
"Troubling Trends in ML Scholarship"
Ali Rahimi
"Machine learning is alchemy" NeurIPS test-of-time talk
Alex Smola
"Dive into Deep Learning" book
Mu Li
D2L co-author; distributed-training essays
Tim Dettmers
bitsandbytes; GPU/LLM hardware blog
Hugging Face team (Thomas Wolf et al.)
Transformers library tutorials and ecosystem essays
Lewis Tunstall
"NLP with Transformers" book
Leandro von Werra
TRL library; RLHF tutorials
Sebastian Bubeck
"Sparks of AGI" report; theory blog
Yoav Goldberg
NLP textbook; LLM critique threads
Hadley Wickham (adjacent)
tidymodels; data-science framework writing
Allen Downey
"Think Bayes", "Think Stats"
Cassie Kozyrkov
decision-intelligence framework essays
Chip Huyen
"Designing Machine Learning Systems"; LLM-eval blog
Eugene Yan
applied ML at scale blog
Vicki Boykis
embeddings primer; "What are embeddings?"
Hamel Husain
LLM eval and fine-tuning playbooks
Jeremy Jordan
ML pedagogy blog
Christopher Potts
Stanford NLU; benchmark-design essays
Stephen Wolfram
"What Is ChatGPT Doing…" essay
Rich Caruana
multi-task learning; intelligible models
Jeff Dean
"Pathways" essay; Google AI architecture writing
Oriol Vinyals
seq2seq; AlphaStar communications
David Silver
RL course at UCL; AlphaGo lectures
Joelle Pineau
reproducibility-in-RL framework; Meta FAIR
Jakob Foerster
multi-agent RL primers
Noam Shazeer
Mixture-of-Experts, multi-query attention papers (and explainers)
Aidan Gomez
Cohere co-founder; transformer-builder commentary
Jared Kaplan
scaling-laws paper
Tom Brown
GPT-3 paper lead; few-shot framing
Alec Radford
GPT-1/2/CLIP communications
John Schulman
PPO; RLHF lecture notes
David Ha
World Models paper; Sakana essays
Yi Tay
efficient-Transformers survey; long-context blog
Tri Dao
FlashAttention papers and explainers
Aleksander Madry
adversarial-robustness framework
Dawn Song
secure ML and decentralized-AI writing
Anca Dragan
assistive-AI / human-robot interaction lectures
Stuart Geman
"stochastic relaxation" foundations; pedagogical writing
Michael I. Jordan
graphical models; "AI revolution hasn't happened yet" essay
Sara Hooker
Cohere For AI; "The Hardware Lottery" essay
Noam Brown
OpenAI researcher and lead on o1 (inference-time scaling / chain-of-thought reasoning). Co-created Libratus (2017 poker AI). Regularly public-speaking on the case that inference-time compute is a new scaling axis.
Jim Fan
NVIDIA Director of Robotics; created Voyager (open-ended LLM agent) and coined the "Physical Turing Test". Co-leads GEAR Lab + Project GR00T on embodied / physical AI.