Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Republican Steve Hilton advances to general election in race for California governor

    June 10, 2026

    Peru election remains on knife edge between Fujimori and Sanchez

    June 10, 2026

    Entergy CEO pushes back on fears that AI data centers will drive up electricity bills

    June 9, 2026
    Facebook X (Twitter) Instagram
    Addison Markets
    • Home
    • USA
    • Europe
    • Business
    • Investing
    • Tech
    • Politics
    • Contact Us
    Addison Markets
    Home»Tech»Anthropic says these topics are too dangerous to let its Fable 5 model talk about
    Tech

    Anthropic says these topics are too dangerous to let its Fable 5 model talk about

    franperez66q@protonmail.comBy franperez66q@protonmail.comJune 9, 2026No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email


    Anthropic Tuesday publicly released Claude Fable 5, its first “Mythos-class” model that it says surpasses its previous frontier Opus models in overall capabilities. But the model’s launch today comes with safeguards designed to prevent it from answering queries on topics like cybersecurity, biology, and chemistry, where the company has publicly worried about its potential impact to “uplift” malicious actors.

    Anthropic says Fable 5 operates on the “same underlying model” as Mythos 5, which is coming out of its monthslong “Mythos Preview” period today, but only for “a small group of cyberdefenders” judged trustworthy through the existing Project Glasswing. Unlike Mythos 5, though, the publicly accessible Fable 5 is designed to funnel queries on certain sensitive topics to the earlier Claude Opus 4.8 model and to warn the user when this is happening.



    Among the many claimed benchmark improvements for Fable 5, the one related to cybersecurity was a particularly large jump.

    Among the many claimed benchmark improvements for Fable 5, the one related to cybersecurity was a particularly large jump.


    Credit:

    Anthropic


    Anthropic said it has tuned these safeguards to be “stricter than ideal,” meaning the system may occasionally refuse “harmless requests” in a way that it acknowledges may be frustrating for regular users. But Anthropic says such false positives come up in less than five percent of all sessions in testing, and were worth it to avoid situations where Mythos could give malicious actors assistance in “causing serious harm that they couldn’t have received from other sources.”

    I can’t let you do that, Dave

    Fable 5’s topic-based safeguards are built around a system of classifiers designed to broadly detect banned prompt subjects as well as any potential jailbreak attempts. In over 1,000 hours of red-team testing with a bug bounty program, Anthropic says external teams failed to find any universal jailbreaks for Fable 5. The new model also resisted automated jailbreak attempts to a much larger degree than previous Claude Opus models, Anthropic said.

    The company said it is particularly worried about Mythos 5’s ability to perform “agentic hacking,” executing multi-part cyberattacks with much more facility than earlier models. But testing from the UK’s AI Security Institute in recent months found that Mythos Preview performed similarly to OpenAI’s GPT-5.5 on a suite of Capture the Flag challenges, suggesting Mythos’ performance is not “a breakthrough specific to one model.”



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    franperez66q@protonmail.com
    • Website

    Related Posts

    Entergy CEO pushes back on fears that AI data centers will drive up electricity bills

    June 9, 2026

    GM eyes new battery type to grow data center, energy storage business

    June 9, 2026

    Paramount accuses Netflix of “scorched-earth campaign” against WBD merger

    June 9, 2026

    Anthropic releases Mythos-like AI model to the public, Claude Fable 5

    June 9, 2026

    One day after discovery, Meta pulls facial recognition code from its smart glasses

    June 9, 2026

    SpaceX IPO explained: Price is set, but retail still up in the air

    June 9, 2026
    Leave A Reply Cancel Reply

    Top Reviews
    Editors Picks

    Republican Steve Hilton advances to general election in race for California governor

    June 10, 2026

    Peru election remains on knife edge between Fujimori and Sanchez

    June 10, 2026

    Entergy CEO pushes back on fears that AI data centers will drive up electricity bills

    June 9, 2026

    Jeffrey Epstein assistant Lesley Groff questioned by House panel

    June 9, 2026
    © 2026 All right reserved
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.