The Most-Watched AI Researcher in the World Chose a Side
Andrej Karpathy announced on May 19 that he had started at Anthropic on the pretraining team. He is one of the most recognizable figures in machine learning — a founding member of OpenAI, the person who ran Tesla’s Autopilot and Full Self-Driving programs, the creator of micrograd and nanoGPT and hours of YouTube tutorials that have taught a generation of engineers how neural networks actually work. He left OpenAI the first time in 2017. He came back in 2023, stayed for one year, left again in 2024 to found Eureka Labs, an AI education startup. And now, without folding Eureka Labs (his posts suggest it continues in some form), he has joined Anthropic’s pretraining team.
The specific role matters. Pretraining is the phase of building a large language model that determines its fundamental capabilities — the massive training run that processes the training data and builds the model’s base knowledge and reasoning capacity. It’s computationally expensive, technically demanding, and strategically central. An Anthropic spokesperson told TechCrunch that Karpathy will build a new team focused on using Claude to accelerate pretraining research itself — the recursive step of applying the model to its own improvement process. That’s not a peripheral research role. That’s Anthropic putting one of the field’s best-known researchers at the core of what makes Claude better at the deepest level.
What This Means for the OpenAI-Anthropic Competition
The talent flow between AI labs is a continuous story, but Karpathy moving to Anthropic is notable on multiple dimensions. He was a founding member of OpenAI — the company Anthropic’s founders left in 2021 after disagreements about safety and commercialization. The founding narrative of Anthropic is that it represents a different approach to AI development than OpenAI: more deliberate, more safety-oriented, more willing to slow down if the safety case requires it. That narrative has been increasingly tested as Anthropic has scaled its commercial ambitions and its models have become competitive with OpenAI’s on capability benchmarks.
Karpathy’s public positioning over the past several years has been carefully non-partisan about labs — he has praised work from OpenAI, Google, and independent researchers equally, and his educational content has been explicitly model-agnostic. The choice to join Anthropic rather than OpenAI (where he could presumably have returned), Google DeepMind (which has courted researchers aggressively), or xAI is a signal that requires reading carefully. He could have gone anywhere. He chose the lab that his former OpenAI colleagues founded after leaving over safety concerns.
That choice doesn’t necessarily say anything definitive about the technical merits of Anthropic’s approach versus OpenAI’s. But it does say something about where Karpathy believes the most interesting pretraining research is happening, or where he believes his specific contributions will be most productive. Karpathy is not someone who takes roles for status or compensation optics. His public record is of someone who moves toward problems he finds genuinely interesting.
Pretraining as the Central Competition
The AI capability race in 2026 has multiple layers: model fine-tuning, deployment infrastructure, agent architecture, multimodal capabilities. But pretraining remains the foundation. The base knowledge, the reasoning patterns, the general capability profile of a model is established in pretraining. Fine-tuning can shape behavior and add specific capabilities, but it cannot substantially alter the base capability ceiling that pretraining set. The labs with the strongest pretraining — the best data curation, the most effective training algorithms, the most efficient use of compute — produce models that are harder to match through post-training optimization alone.
Anthropic’s Claude has been competitive with OpenAI’s GPT and Google’s Gemini on capability benchmarks while maintaining the safety and instruction-following properties that Anthropic has prioritized since its founding. Whether that competitive position is sustainable — whether Anthropic’s pretraining approach can keep pace with the resources OpenAI and Google are deploying — is the strategic question Karpathy is being brought in to help answer.
The specific mandate — using Claude to accelerate pretraining research — is the frontier of what’s called AI-assisted AI development. If Claude can help identify more effective training approaches, better data curation strategies, or more efficient hyperparameter regimes for the next training run, the pace of Anthropic’s model improvement could accelerate faster than the underlying compute expenditure growth would suggest. This is the virtuous cycle that every frontier lab is trying to establish: using the current model to build a better next model faster.
Karpathy’s Educational Role and What It Means for Anthropic
Karpathy’s YouTube channel has over 1.5 million subscribers. His courses on neural networks and language models from scratch have been the primary technical education resource for a generation of engineers who learned ML outside of formal academic programs. His ability to explain complex technical concepts clearly and precisely is as well-documented as his research contributions. This is relevant to Anthropic because one of Anthropic’s stated missions is AI safety research, and safety research requires the broader technical community to understand what frontier models are actually doing at a mechanistic level.
Whether Karpathy continues his educational work while at Anthropic is unclear from the announcement. If he does, Anthropic gains a researcher with a public platform who can communicate what Anthropic is building and why in ways that resonate with the technical community that has been the primary audience for his work. If the educational work pauses, Anthropic still gains one of the field’s strongest pretraining researchers with specific expertise in the training pipeline optimizations that have historically produced significant capability improvements.
Either way, the hire is the kind of signal that changes how the technical community evaluates the labs. Research talent aggregates toward other research talent. The researchers who are deciding where to do their best work look at where the interesting problems are and who they’d be working with. Karpathy’s arrival at Anthropic makes the pretraining team more attractive to the next researcher who’s deciding.
The Eureka Labs Question
Karpathy founded Eureka Labs in 2024 with the mission of applying AI to education — specifically, building AI teaching assistants that could make high-quality education more accessible at scale. The initial product was an AI-native course platform. The project was early and the progress was slower than the initial enthusiasm suggested. Whether Eureka Labs continues as a separate entity, becomes part of Anthropic’s research agenda, or pauses while Karpathy focuses on pretraining work is not entirely clear from the announcement.
The overlap between Anthropic’s mission and Karpathy’s educational interests is genuine. Anthropic has published some of the most important interpretability research in the field — work that tries to understand what’s actually happening inside large language models at a mechanistic level. Karpathy’s ability to translate that kind of research for a broad technical audience is directly relevant to Anthropic’s goal of making its safety research influential beyond its own lab. If the educational mission finds a home inside Anthropic’s research communication strategy, the combination could be more effective than either element separately.
What the Field Is Watching
The specific technical contributions Karpathy makes to Anthropic’s pretraining pipeline will not be publicly visible until they show up in the capabilities of the next Claude version. That could be six months from now or eighteen months from now depending on where the current training run is in its cycle. The signal from the hiring will be interpreted by other researchers as an indicator of where the most interesting pretraining work is happening, regardless of whether the outputs are immediately measurable.
For the broader AI competition, the Karpathy move reinforces a pattern: the talent that defines the field’s direction is not locked to any single institution, and the labs that can attract researchers who have demonstrated both technical excellence and the ability to work productively outside established institutional constraints will have an advantage in the next phase of capability development.
Anthropic got one of those researchers this week. OpenAI, for the second time, watched him leave.
What The Second Departure Reveals About How AI Talent Actually Works
Karpathy leaving OpenAI once was an individual decision. Karpathy leaving OpenAI and then joining Anthropic is a data point about the structure of the field — and the structure is more fragile than the press cycles around each individual hire make it appear.
The labs have been competing as if the talent market works the way capital markets work: money allocated to the highest return, talent flowing to the highest valuation, market signals clearing efficiently. It doesn’t work that way. The researchers who matter most in this generation of AI development are not optimising for compensation. They are optimising for the quality of the research environment, the clarity of the research direction, and — increasingly — their judgment about which lab’s values and working culture are compatible with sustained high-performance work. Those are not attributes capital can easily buy or replicate.
OpenAI’s problem is not that Karpathy left twice. Its problem is that both departures followed changes to the research environment rather than changes to the compensation structure. The first departure was voluntary exit; the second is a competitive hire by a lab whose research culture Karpathy evidently considers more compatible with the work he wants to do. That distinction matters more than the headline number on either side.
For Anthropic, the hire is evidence of something harder to manufacture than valuation: the research environment is now compelling enough to attract a researcher who had other options and chose based on quality rather than commercial outcome. Anthropic’s commercial acceleration over OpenAI gives the lab the runway to maintain that environment — but the hire is not a commercial story. It is a research-culture story that commercial success is currently making possible. The distinction between the two will matter when the commercial cycle turns.
Karpathy’s Move Is Evidence About Organizational Culture, Not Just Compensation
Glenn Greenwald’s editorial instinct is to read what institutions reveal through behavior rather than what they say about themselves. Karpathy’s departure from OpenAI to Anthropic says publicly that he wants to focus on pretraining research in an environment where that work is central. What it says institutionally is more specific.
OpenAI has now lost two of its most publicly visible technical researchers — Karpathy here, Ilya Sutskever earlier — both moving in the same directional signal: away from the lab that claims to lead the field. Two departures in the same direction is not a coincidence. It is an organizational data point about what OpenAI optimizes for in 2026, which differs from what it optimized for in 2020. The pressure of commercial deployment at scale, the governance disruption of 2023, and the shift toward inference-heavy product development have changed what it means to do foundational research at OpenAI. That change does not make OpenAI worse at building products. It makes it a different environment for researchers who weight pretraining as the central intellectual problem.
Technology reporting at the time of the departure emphasized compensation and opportunity framing. That framing is accurate and incomplete. Karpathy’s public track record — his educational work at Eureka Labs, his stated pretraining focus — is consistent with someone who weights research environment quality heavily. Anthropic’s positioning as a safety-focused lab with pretraining as a core investment is the most coherent explanation for where he landed. The consequences of the move extend beyond the research contribution: Anthropic’s $900 billion commercial trajectory is now reinforced by the strongest signal in the field that its research environment is where serious pretraining work is happening. Whether the signal is more valuable than the work itself is a question the next eighteen months of publication record will answer.

