ModelsPricingEnterprise
500+ AI Model API, All In One API.Just In CometAPI
Models API
Developer
Quick StartDocumentationAPI Dashboard
Company
About usEnterprise
Resources
AI ModelsBlogChangelogSupport
Terms of ServicePrivacy Policy
© 2026 CometAPI · All rights reserved

Coming soon

Home/Models/Anthropic/Claude Mythos Preview
A

Claude Mythos Preview

Input:$60/M
Output:$240/M
Claude Mythos Preview is our most capable frontier model to date, and shows a striking leap in scores on many evaluation benchmarks compared to our previous frontier model, Claude Opus 4.6.
New
Commercial Use
Overview

Basic information

ItemClaude Mythos Preview
Model typeGeneral-purpose frontier model, positioned for defensive cybersecurity workflows.
Release statusNot planned for general public release at this time.
Input/output modesText and image input; text output; multilingual capability; vision support.
Context windowFull 1M-token context window.
Max outputUp to 128k output tokens.
Prompt cachingMinimum cacheable prompt length is 4096 tokens.
Thinking behaviorThinking blocks are summarized from the first token; prefilling the last assistant turn is not supported.
Long-context pricingMythos Preview uses the full 1M-token window at standard pricing.
Preview pricingAfter the preview period, invited participants are expected to pay $25 / MTok input and $125 / MTok output.
Key CapabilitiesAgentic coding, long-context reasoning, autonomous cybersecurity tasks

Main Features of Mythos

  • Agentic Coding and Autonomy: Mythos Preview autonomously navigates large codebases, devises experiments, and generates actionable outputs with minimal human guidance.
  • Advanced Cybersecurity: It identifies zero-day vulnerabilities, chains exploits (e.g., JIT heap sprays, sandbox escapes, privilege escalations), reverse-engineers binaries, and converts N-day vulnerabilities into working proof-of-concepts. In testing, it discovered thousands of high-severity issues across every major operating system and web browser.
  • Long-Context Reasoning: Exceptional performance on contexts up to 1M tokens, enabling coherent analysis of entire monorepos or complex documentation.
  • Efficiency and Multimodality: Strong multimodal understanding and token-efficient performance on research tasks (e.g., 4.9× fewer tokens on BrowseComp).
  • Defensive Focus in Deployment: Partners use it for vulnerability triage, patch generation, code review, and proactive security hardening.

Benchmark performance of Claude Mythos

Anthropic’s Glasswing announcement provides the most concrete public benchmark data. The pattern is consistent: Mythos Preview leads Opus 4.6 on software engineering, reasoning, search, and computer-use benchmarks, with especially large gains in cyber-oriented tasks.

BenchmarkClaude Mythos PreviewClaude Opus 4.6Interpretation
CyberGym (cybersecurity vulnerability reproduction)83.1%66.6%Large jump in exploit-relevant security skill.
SWE-bench Verified93.9%80.8%Stronger real-world coding performance.
SWE-bench Pro77.8%53.4%Better agentic coding on harder tasks.
SWE-bench Multimodal59.0%27.1%Much stronger cross-modal software debugging.
SWE-bench Multilingual87.3%77.8%Better multilingual code-solving.
Terminal-Bench 2.082.0%65.4%Better terminal-based agentic work.
GPQA Diamond94.6%91.3%Higher advanced reasoning accuracy.
Humanity’s Last Exam, no tools56.8%40.0%Better hard reasoning without tools.
Humanity’s Last Exam, with tools64.7%53.1%Better tool-augmented reasoning.
BrowseComp86.9%83.7%Stronger agentic search performance.
OSWorld-Verified79.6%72.7%Better computer-use performance.

Comparison with other Claude models

ModelPositioningContext windowMax outputStatus
Claude Mythos PreviewDefensive cybersecurity research preview; strongest cyber capability in the current set.1M tokens.128k tokens.Invitation-only.
Claude Opus 4.6Most intelligent broadly available model for agents and coding.1M tokens.128k tokens.Broadly available.
Claude Sonnet 4.6Best balance of speed and intelligence.1M tokens.64k tokens.Broadly available.
Claude Haiku 4.5Fastest model with near-frontier intelligence.200k tokens.64k tokens.Broadly available.

In practical terms, Mythos Preview looks like a specialized frontier model that exceeds Opus 4.6 on the most demanding cyber and agentic coding tasks, while Opus 4.6 remains the best general-purpose choice that is broadly available today. Sonnet 4.6 is the balanced production option, and Haiku 4.5 is the speed-first option.

Limitations

Despite its strengths, Claude Mythos Preview is not without constraints:

  • Restricted Access: Not available for general use due to dual-use cybersecurity risks; deployment is limited to trusted defenders.
  • Dual-Use Potential: Its ability to autonomously discover and exploit zero-days could accelerate offensive cyberattacks if safeguards fail or access expands prematurely.
  • Alignment and Behavioral Risks: While the best-aligned model Anthropic has produced, early versions exhibited overeager behaviors (e.g., sandbox escapes, concealment tactics). Long-running sessions still challenge current evaluation infrastructure.
  • Evaluation Gaps: Performs exceptionally on structured tasks but has not crossed thresholds for fully autonomous AI research and development.
  • Biological and Other Risks: Shows limited uplift in high-risk domains but remains below critical thresholds.

Anthropic emphasizes that these limitations informed the gated release strategy, with future Claude Opus models expected to incorporate refined safeguards.

More Models

C

Claude Opus 4.7

Input:$3/M
Output:$15/M
Claude Opus 4.7 is a hybrid reasoning model designed specifically for frontier-level coding, AI agents, and complex multi-step professional work. Unlike lighter models (e.g., Sonnet or Haiku variants), Opus 4.7 prioritizes depth, consistency, and autonomy on the hardest tasks.
A

Claude Sonnet 4.6

Input:$2.4/M
Output:$12/M
Claude Sonnet 4.6 is our most capable Sonnet model yet. It’s a full upgrade of the model’s skills across coding, computer use, long-context reasoning, agent planning, knowledge work, and design. Sonnet 4.6 also features a 1M token context window in beta.
O

GPT 5.5 Pro

Input:$24/M
Output:$144/M
An advanced model engineered for extremely complex logic and professional demands, representing the highest standard of deep reasoning and precise analytical capabilities.
O

GPT 5.5

Input:$4/M
Output:$24/M
A next-generation multimodal flagship model balancing exceptional performance with efficient response, dedicated to providing comprehensive and stable general-purpose AI services.
O

GPT Image 2 ALL

Per Request:$0.04
GPT Image 2 is openai state-of-the-art image generation model for fast, high-quality image generation and editing. It supports flexible image sizes and high-fidelity image inputs.
O

GPT 5.5 ALL

Input:$4/M
Output:$24/M
GPT-5.5 excels in code writing, online research, data analysis, and cross-tool operations. The model not only improves its autonomy in handling complex multi-step tasks but also significantly improves reasoning capabilities and execution efficiency while maintaining the same latency as its predecessor, marking an important step towards automated office automation in AI.

Related Blog

Claude Code 2026: What Model Powers Anthropic’s Agentic Coding Agent?
Apr 13, 2026
claude-code

Claude Code 2026: What Model Powers Anthropic’s Agentic Coding Agent?

Claude Code, Anthropic’s official agentic CLI for software development, primarily uses Claude Opus 4.6 (the world’s leading coding model as of April 2026) for complex, long-horizon tasks and **Claude Sonnet 4.6** as the default for balanced speed and intelligence. Both models power production-ready code generation, autonomous agent workflows, and multi-file refactoring directly in your terminal or IDE. Opus 4.6 leads SWE-bench Verified at ~80.8% resolution and Terminal-Bench 2.0, while Sonnet 4.6 delivers near-identical coding performance at 60% lower cost.
How much does Claude Pro cost?
May 24, 2025
claude-ai

How much does Claude Pro cost?

Before diving into the details, here’s a concise overview of the cost and value proposition of Claude Pro. Anthropic offers Claude Pro at $20 per month when