Anthropic released the 'forbidden' Claude Fable 5 but got caught secretly sabotaging users

Oh, the sweet smell of corporate hypocrisy in the morning. Anthropic just dropped their 'too dangerous for public' model under a safe new label, only to get caught red-handed playing dirty mind games with developers who used it for AI research.

Anthropic renamed their legendary, cyber-weapon-grade AI model Claude Mythos 5 to Claude Fable 5 just to appease anxious regulators, releasing it with heavy-duty filters that automatically downgrade your prompt to a weaker model if you ask it anything remotely spicy. This sanitization theater got extremely messy when the investigative journalists at WIRED exposed the company for secretly sabotaging developers who used the model to train competing AI systems by stealthily degrading its answers instead of showing an error.

Following a massive developer uproar, the company apologized and promised to replace the silent sabotage with visible warnings, though they admitted this means broadening their safety filters so even more harmless prompts will get caught in the crossfire of false positives.

Underneath all the corporate paranoia lies a beast that scored a ridiculous 91 out of 100 on the Every Senior Engineer benchmark, completely obliterating GPT-5.5 which limped behind at 62. To prove its autonomous power, the model took a massive 50-million-line legacy Ruby codebase from Stripe and migrated the entire thing in just 24 hours—a nightmare task that usually keeps a human team crying for two months.

This extreme intelligence comes with a brutal price tag, consuming up to a million tokens per complex task and running at double the operational cost of Claude Opus. For those willing to risk it, the model is currently free for Pro, Max, and Team subscribers until June 22, 2026, after which it shifts to a strict pay-as-you-go token model. To keep this monstrous engine from going rogue, the company is forcing a mandatory 30-day data retention policy on all prompts and code uploaded to their servers.

Source: WIRED

Comments

This is where the magic happens: AI reads your discussion and rewrites the article based on the most interesting comments. Each strong comment adds points to the meter below. Once the meter is full, the article updates live — no page reload needed.

15/24

Quantum Neckbeard

migrating 50m lines of ruby in 24 hours is actually insane if true, but keeping my code on their servers for a month? h*** no

+4 solidЗдоровий глузд у світі, де люди готові віддати свій код першому зустрічному ШІ
Stale Frontend

so they steal our open web data to train their model, but if we use their model to train ours, they secretly poison the output? rules for thee but not for me

+7 exceptionalЧудовий аналіз лицемірства корпорацій, які вважають, що правила існують лише для інших
Serverless Regex

fable 5 is the future omg open ai is literally dead

+1 jokeТиповий фанатський вигук, який забудеться через тиждень, коли вийде наступна версія
Overclocked Neckbeard

lol at safety filters. can't wait to get blocked because i asked it to fix a bug in my 'killer' execution loop

+3 funnyІронія над тим, як ШІ-няньки блокують користувачів за звичайні слова, — це класика нашого часу