Harvard Physicist Uses Claude to Produce Frontier Physics Paper in Two Weeks

Research1 source·Mar 24

anthropic claude science autoresearch ai-progress agents

Summary

• Harvard physics professor produced a frontier paper in two weeks using Claude Opus 4.5.
• Project required 110+ drafts, 36M tokens, and 40+ hours of local CPU compute.
• Claude was fast and capable but sloppy enough that domain expertise was essential.
• Author calls it a turning point in AI-assisted science — 'there is no going back.'

Adjust signal

Details

#	Type	Key Point	Context
1	Research	Frontier physics paper produced in two weeks with Claude Opus 4.5	Schwartz describes the result as a technically rigorous, impactful high-energy theoretical physics paper. The process involved 110 separate drafts and 36 million tokens, compressed into roughly two weeks versus the typical timeline of about a year.
2	Stat	110 drafts, 36M tokens, 40+ hours of local CPU compute	These figures give a concrete sense of the iteration volume and computational cost involved in using Claude as a research collaborator on a single physics paper — intensive and iterative, not a simple prompt-and-publish pipeline.
3	Insight	Domain expertise was essential — Claude was fast but sloppy	Schwartz stresses that while Claude was fast and eager to please, it made enough errors that a domain expert was necessary to evaluate its outputs. He explicitly states AI is not doing end-to-end science yet, and the human's role shifted from doing the work to supervising and validating it.
4	Insight	Author calls this the most important paper he has written — for the method, not the physics	Schwartz frames the significance not in the scientific findings but in demonstrating a repeatable method for using LLMs to contribute to frontier science — something he says was not possible three months earlier. He concludes 'there is no going back.'
5	Context	Existing AI scientist projects criticized for selecting best result of many automated trials	Schwartz references Sakana AI's AI Scientist (August 2024), Google's AI co-scientist on Gemini (February 2025), and AI2's Asta ecosystem including CodeScientist and AutoDiscovery (August 2025), arguing their successes seem forced rather than genuinely autonomous.
6	Industry Update	AI has had more documented success in mathematics than in theoretical physics	DeepMind's FunSearch (2023) and AlphaEvolve made new discoveries in combinatorics. AlphaProof earned a silver medal at the 2024 International Mathematical Olympiad, and in 2025 an advanced version of Gemini achieved the gold-medal standard at the IMO.
7	Insight	Author suggests LLMs may need an intermediate phase before fully autonomous science	Schwartz argues the field should not skip intermediate steps on the path to end-to-end AI science, using the analogy that LLMs may need to go through a graduate school phase before reaching full scientific autonomy.

1.Research

Frontier physics paper produced in two weeks with Claude Opus 4.5

Schwartz describes the result as a technically rigorous, impactful high-energy theoretical physics paper. The process involved 110 separate drafts and 36 million tokens, compressed into roughly two weeks versus the typical timeline of about a year.

2.Stat

110 drafts, 36M tokens, 40+ hours of local CPU compute

These figures give a concrete sense of the iteration volume and computational cost involved in using Claude as a research collaborator on a single physics paper — intensive and iterative, not a simple prompt-and-publish pipeline.

3.Insight

Domain expertise was essential — Claude was fast but sloppy

Schwartz stresses that while Claude was fast and eager to please, it made enough errors that a domain expert was necessary to evaluate its outputs. He explicitly states AI is not doing end-to-end science yet, and the human's role shifted from doing the work to supervising and validating it.

4.Insight

Author calls this the most important paper he has written — for the method, not the physics

Schwartz frames the significance not in the scientific findings but in demonstrating a repeatable method for using LLMs to contribute to frontier science — something he says was not possible three months earlier. He concludes 'there is no going back.'

5.Context

Existing AI scientist projects criticized for selecting best result of many automated trials

Schwartz references Sakana AI's AI Scientist (August 2024), Google's AI co-scientist on Gemini (February 2025), and AI2's Asta ecosystem including CodeScientist and AutoDiscovery (August 2025), arguing their successes seem forced rather than genuinely autonomous.

6.Industry Update

AI has had more documented success in mathematics than in theoretical physics

DeepMind's FunSearch (2023) and AlphaEvolve made new discoveries in combinatorics. AlphaProof earned a silver medal at the 2024 International Mathematical Olympiad, and in 2025 an advanced version of Gemini achieved the gold-medal standard at the IMO.

7.Insight

Author suggests LLMs may need an intermediate phase before fully autonomous science

Schwartz argues the field should not skip intermediate steps on the path to end-to-end AI science, using the analogy that LLMs may need to go through a graduate school phase before reaching full scientific autonomy.

Research = scientific work and findings, Stat = quantitative data points, Insight = author's argued position or analysis, Context = background on related projects and systems, Industry Update = broader field developments

What This Means

A Harvard physics professor has demonstrated that a skilled domain expert working closely with Claude Opus 4.5 can compress roughly a year of theoretical physics research into two weeks — but only because the expert caught the model's frequent errors. This is not a story about AI replacing scientists; it is about AI dramatically accelerating the work of experts who know enough to supervise it. The author's view is that a method now exists for using LLMs to do frontier science, and that this represents a qualitative shift from what was possible just months ago.

Sources

Vibe physics: The AI grad studentAnthropic

Similar Events

Anthropic Releases Claude Opus 4.7: Hybrid Reasoning, 1M Context Window, and Cybersecurity Safeguards

Apr 16

Claude Code Architecture Dissected in Academic Paper

Apr 20