Claude Fable 5 and Claude Mythos 5

At the moment we’re launching Claude Fable 5: a Mythos-class¹ mannequin that we’ve made protected for basic use.

Fable 5’s capabilities exceed these of any mannequin we’ve ever made typically accessible. It’s state-of-the-art on practically all examined benchmarks of AI functionality, exhibiting distinctive efficiency in software program engineering, information work, imaginative and prescient, scientific analysis, and plenty of different areas. The longer and extra complicated the duty, the bigger Fable 5’s lead over our different fashions.

Releasing a mannequin this succesful comes with dangers. With out safeguards, Fable 5’s capabilities in areas like cybersecurity could possibly be misused to trigger critical harm. We’ve due to this fact launched the mannequin with safeguards that imply queries on some subjects will as an alternative obtain a response from our next-most-capable mannequin, Claude Opus 4.8. To launch the mannequin each safely and shortly, we’ve tuned these safeguards conservatively—they’ll generally catch innocent requests, although they set off, on common, in lower than 5% of classes. With extra succesful fashions arriving within the coming months, we’re working to enhance our safeguards and scale back false positives as shortly as we will.

For a small group of cyberdefenders and infrastructure suppliers, we’re additionally launching Claude Mythos 5. It’s the identical underlying mannequin as Fable 5, however with the safeguards lifted in some areas.² Mythos 5 will initially be deployed by Project Glasswing, in collaboration with the US authorities, as an improve to Claude Mythos Preview. It has the strongest cybersecurity capabilities of any mannequin on the earth. Quickly, we intend to increase entry to Mythos 5 by a broader trusted entry program.

The capabilities of fashions like Fable 5 and Mythos 5 have the potential to do profound good for the world. We’ve seen the beginnings of this in Venture Glasswing, the place the fashions have helped cyber defenders safe critically necessary software program. We’ve additionally seen it in life sciences analysis, the place the fashions are positing novel hypotheses and dashing up the event of recent therapeutics.

Fable 5 and Mythos 5 are being supplied at $10 per million enter tokens and $50 per million output tokens—lower than half the worth of Claude Mythos Preview. At the moment’s joint launch is one other step in the direction of our purpose of bringing superior AI capabilities to as many customers as potential, as shortly and as safely as we will.

Evaluating Claude Fable 5 and Claude Mythos 5

The desk beneath compares the capabilities of Fable 5 and Mythos 5 to different main fashions.

Benchmark table showing Claude Fable and Mythos compared to other leading models

Fable 5 and Mythos 5 can work autonomously for longer than any earlier Claude fashions. Beneath we focus on how these expertise apply to software program engineering, and canopy the mannequin’s improved capabilities in information work, imaginative and prescient, reminiscence, and life sciences analysis.

Software program engineering. Throughout early testing, Stripe reported that Fable 5 compressed months of engineering into days. In a 50-million-line Ruby codebase, the mannequin carried out a codebase-wide migration in a day that will in any other case have taken an entire workforce over two months by hand. Fable 5 can be extra token-efficient than previous Claude fashions: on Cognition’s FrontierCode analysis, which exams whether or not fashions can move troublesome coding duties whereas assembly the requirements of high-quality manufacturing codebases, Fable 5 scores highest amongst frontier fashions, even at medium effort.

Data work. Fable 5 reveals robust efficiency on complicated analytical duties. On Hebbia’s Finance Benchmark for senior-level reasoning, Fable 5 has the very best rating of any mannequin, with substantial beneficial properties in document-based reasoning, chart and desk interpretation, and drawback fixing. IMC famous that Fable 5 aced their trading-analysis evaluations practically throughout the board, together with factual lookup, conceptual reasoning, root-cause evaluation, and expected-value evaluation.

Imaginative and prescient. Fable 5 is the brand new state-of-the-art mannequin for duties involving imaginative and prescient. It might probably extract exact numbers from detailed scientific figures and might carry out complicated vision-based duties like rebuilding an internet app’s supply code from screenshots alone. It additionally wants much less scaffolding: for instance, earlier Claude fashions struggled to play Pokémon Hearth even with harnesses that gave them further useful instruments, however Fable 5 beat FireRed with a minimal, vision-only harness.