Lists / People

Anthropic Interpretability Team

circuits, sparse autoencoder, monosemanticity reports

Anthropic's mechanistic interpretability research group, known for the circuits thread, monosemanticity findings, and sparse autoencoder papers on neural network internals.

@AnthropicAI Wikipedia

#37AI Researchers Who Communicate

55/ 100

Authority scoreEstablished

1lists#37peak1primary1category

Top 100 AI Researchers Who Communicate

Rank #37·circuits, sparse autoencoder, monosemanticity reports

Appears alongside

Top neighbors of Anthropic Interpretability Team

8people

Aidan Gomez

Cohere co-founder; transformer-builder commentary

1shared list

Ajeya Cotra

biological-anchors timelines report

1shared list

Alec Radford

GPT-1/2/CLIP communications

1shared list

Aleksander Madry

adversarial-robustness framework

1shared list

Alex Smola

"Dive into Deep Learning" book

1shared list

Ali Rahimi

"Machine learning is alchemy" NeurIPS test-of-time talk

1shared list

Allen Downey

"Think Bayes", "Think Stats"

1shared list

Anca Dragan

assistive-AI / human-robot interaction lectures

1shared list

Last updated

10 May 2026

Suggest an edit →