A Chinese AI model will demonstrably outperform Anthropic's Claude Opus 4.6 or a newer Claude model on a widely recognized AI benchmark before June 21, 2026.
0%
A Chinese AI model will demonstrably outperform Anthropic's Claude Opus 4.6 or a newer Claude model on a widely recognized AI benchmark before June 21, 2026.
Resolution date
Tags
Recent reports indicate that Chinese AI models are rapidly advancing, with one agent system surging to second place globally on Terminal-Bench 2.0. The Kimi AI model is also reported to outperform Claude in coding tasks and other categories, often at a lower cost. Meanwhile, Anthropic has recently updated its Claude Opus model to version 4.6, enhancing its capabilities in complex tasks like financial analysis.
Resolved YES if a reputable AI research institution (e.g., Stanford HAI, Hugging Face, or a major university lab) or a widely accepted benchmark platform (e.g., Terminal-Bench, MT-Bench, HELM) publishes results showing a Chinese AI model demonstrably outperforming Anthropic's Claude Opus 4.6 or any newer Claude model released before the resolution date, in at least one significant capability (e.g., coding, reasoning, complex problem-solving). The performance must be statistically significant and publicly verifiable. Resolved NO otherwise.