Anthropic's Claude Opus 4.7 beats humans in robotics planning tasks
Anthropic's Claude Opus 4.7 completed robotics planning tasks 20 times faster than human teams in Phase Two of Project Fetch. The model excelled at decision-making and code efficiency but failed at precise motion control tasks. Improvements were attributed to general model scaling, not specific robotics research.

*this image is generated using AI for illustrative purposes only.
Anthropic has released findings from its "Project Fetch" Phase Two, indicating that its Claude Opus 4.7 model can outperform human teams in robotics planning tasks. The company reported that the model completed tested objectives at speeds roughly 20 times faster than the quickest human teams from the previous year. While the model demonstrated significant speed advantages, it faced limitations in precision control, highlighting a gap between planning and physical execution.
Performance Comparison
The trials involved three tests using Claude Opus 4.7 within Claude Code. A researcher connected a laptop to the robot, entered the initial prompt, and approved commands and task transitions. The results were compared against an original experiment from August 2025, which pitted Anthropic employees using Claude support against a group limited to web research and their own problem-solving.
| Metric | Claude Opus 4.7 | Human Teams |
|---|---|---|
| Speed on tested objectives | ~20x faster | Baseline |
| Speed on completed steps | At least 10x faster | Baseline |
Capabilities and Limitations
Anthropic noted that Opus 4.7 moved quickly through choices that typically slow humans down, such as interfacing with the robot's sensors. The model produced less code than the Claude-assisted human team while matching or exceeding outcomes on the tested tasks. However, the company emphasized that these results do not constitute a robotics breakthrough. The model struggled with the "fetching" portion of the test, which required precisely guiding a beach ball back to a start area using environmental feedback. While Opus 4.7 could position the robot to attempt the push, the motion control was not accurate enough to complete the task.
Future Research
The improvements were driven by broader model scaling rather than targeted robotics research. Anthropic stated that more research is needed to understand the models' ability to make physical tools more bespoke, whether by writing control policies or designing robotic systems. The company suggested that while barriers exist to generalized physically capable language models, rapid advancements in software tools suggest a similar trajectory in hardware is possible.
How will Anthropic address the precision control gap to bridge the disconnect between high-level planning and physical execution?
Could the speed advantages demonstrated in software planning translate to commercial robotics applications requiring real-time decision-making?
What specific hardware advancements are necessary to support the trajectory of generalized physically capable language models?




























