The Most-Watched AI Researcher in the World Chose a Side
Andrej Karpathy announced on May 19 that he had started at Anthropic on the pretraining team. He is one of the most recognizable figures in machine learning — a founding member of OpenAI, the person who ran Tesla’s Autopilot and Full Self-Driving programs, the creator of micrograd and nanoGPT and hours of YouTube tutorials that have taught a generation of engineers how neural networks actually work. He left OpenAI the first time in 2017. He came back in 2023, stayed for one year, left again in 2024 to found Eureka Labs, an AI education startup. And now, without folding Eureka Labs (his posts suggest it continues in some form), he has joined Anthropic’s pretraining team.
The specific role matters. Pretraining is the phase of building a large language model that determines its fundamental capabilities — the massive training run that processes the training data and builds the model’s base knowledge and reasoning capacity. It’s computationally expensive, technically demanding, and strategically central. An Anthropic spokesperson told TechCrunch that Karpathy will build a new team focused on using Claude to accelerate pretraining research itself — the recursive step of applying the model to its own improvement process. That’s not a peripheral research role. That’s Anthropic putting one of the field’s best-known researchers at the core of what makes Claude better at the deepest level.
What This Means for the OpenAI-Anthropic Competition
The talent flow between AI labs is a continuous story, but Karpathy moving to Anthropic is notable on multiple dimensions. He was a founding member of OpenAI — the company Anthropic’s founders left in 2021 after disagreements about safety and commercialization. The founding narrative of Anthropic is that it represents a different approach to AI development than OpenAI: more deliberate, more safety-oriented, more willing to slow down if the safety case requires it. That narrative has been increasingly tested as Anthropic has scaled its commercial ambitions and its models have become competitive with OpenAI’s on capability benchmarks.
Karpathy’s public positioning over the past several years has been carefully non-partisan about labs — he has praised work from OpenAI, Google, and independent researchers equally, and his educational content has been explicitly model-agnostic. The choice to join Anthropic rather than OpenAI (where he could presumably have returned), Google DeepMind (which has courted researchers aggressively), or xAI is a signal that requires reading carefully. He could have gone anywhere. He chose the lab that his former OpenAI colleagues founded after leaving over safety concerns.
That choice doesn’t necessarily say anything definitive about the technical merits of Anthropic’s approach versus OpenAI’s. But it does say something about where Karpathy believes the most interesting pretraining research is happening, or where he believes his specific contributions will be most productive. Karpathy is not someone who takes roles for status or compensation optics. His public record is of someone who moves toward problems he finds genuinely interesting.
Pretraining as the Central Competition
The AI capability race in 2026 has multiple layers: model fine-tuning, deployment infrastructure, agent architecture, multimodal capabilities. But pretraining remains the foundation. The base knowledge, the reasoning patterns, the general capability profile of a model is established in pretraining. Fine-tuning can shape behavior and add specific capabilities, but it cannot substantially alter the base capability ceiling that pretraining set. The labs with the strongest pretraining — the best data curation, the most effective training algorithms, the most efficient use of compute — produce models that are harder to match through post-training optimization alone.
Anthropic’s Claude has been competitive with OpenAI’s GPT and Google’s Gemini on capability benchmarks while maintaining the safety and instruction-following properties that Anthropic has prioritized since its founding. Whether that competitive position is sustainable — whether Anthropic’s pretraining approach can keep pace with the resources OpenAI and Google are deploying — is the strategic question Karpathy is being brought in to help answer.
The specific mandate — using Claude to accelerate pretraining research — is the frontier of what’s called AI-assisted AI development. If Claude can help identify more effective training approaches, better data curation strategies, or more efficient hyperparameter regimes for the next training run, the pace of Anthropic’s model improvement could accelerate faster than the underlying compute expenditure growth would suggest. This is the virtuous cycle that every frontier lab is trying to establish: using the current model to build a better next model faster.
Karpathy’s Educational Role and What It Means for Anthropic
Karpathy’s YouTube channel has over 1.5 million subscribers. His courses on neural networks and language models from scratch have been the primary technical education resource for a generation of engineers who learned ML outside of formal academic programs. His ability to explain complex technical concepts clearly and precisely is as well-documented as his research contributions. This is relevant to Anthropic because one of Anthropic’s stated missions is AI safety research, and safety research requires the broader technical community to understand what frontier models are actually doing at a mechanistic level.
Whether Karpathy continues his educational work while at Anthropic is unclear from the announcement. If he does, Anthropic gains a researcher with a public platform who can communicate what Anthropic is building and why in ways that resonate with the technical community that has been the primary audience for his work. If the educational work pauses, Anthropic still gains one of the field’s strongest pretraining researchers with specific expertise in the training pipeline optimizations that have historically produced significant capability improvements.
Either way, the hire is the kind of signal that changes how the technical community evaluates the labs. Research talent aggregates toward other research talent. The researchers who are deciding where to do their best work look at where the interesting problems are and who they’d be working with. Karpathy’s arrival at Anthropic makes the pretraining team more attractive to the next researcher who’s deciding.
The Eureka Labs Question
Karpathy founded Eureka Labs in 2024 with the mission of applying AI to education — specifically, building AI teaching assistants that could make high-quality education more accessible at scale. The initial product was an AI-native course platform. The project was early and the progress was slower than the initial enthusiasm suggested. Whether Eureka Labs continues as a separate entity, becomes part of Anthropic’s research agenda, or pauses while Karpathy focuses on pretraining work is not entirely clear from the announcement.
The overlap between Anthropic’s mission and Karpathy’s educational interests is genuine. Anthropic has published some of the most important interpretability research in the field — work that tries to understand what’s actually happening inside large language models at a mechanistic level. Karpathy’s ability to translate that kind of research for a broad technical audience is directly relevant to Anthropic’s goal of making its safety research influential beyond its own lab. If the educational mission finds a home inside Anthropic’s research communication strategy, the combination could be more effective than either element separately.
What the Field Is Watching
The specific technical contributions Karpathy makes to Anthropic’s pretraining pipeline will not be publicly visible until they show up in the capabilities of the next Claude version. That could be six months from now or eighteen months from now depending on where the current training run is in its cycle. The signal from the hiring will be interpreted by other researchers as an indicator of where the most interesting pretraining work is happening, regardless of whether the outputs are immediately measurable.
For the broader AI competition, the Karpathy move reinforces a pattern: the talent that defines the field’s direction is not locked to any single institution, and the labs that can attract researchers who have demonstrated both technical excellence and the ability to work productively outside established institutional constraints will have an advantage in the next phase of capability development.
Anthropic got one of those researchers this week. OpenAI, for the second time, watched him leave.
