Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    CrowdStrike warns of increasing Chinese AI cyberattacks on U.S. tech

    June 10, 2026

    Lenovo slides nearly 10% amid reports of price hikes due to memory costs

    June 10, 2026

    Three key vital signs make up the “urban pulse” of a city

    June 10, 2026
    Facebook X (Twitter) Instagram
    Addison Markets
    • Home
    • USA
    • Europe
    • Business
    • Investing
    • Tech
    • Politics
    • Contact Us
    Addison Markets
    Home»Tech»These LLMs are the best at resisting Russian propaganda
    Tech

    These LLMs are the best at resisting Russian propaganda

    franperez66q@protonmail.comBy franperez66q@protonmail.comJune 5, 2026No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email


    Open-weight models, including Nvidia’s Nemotron and Alibaba’s Qwen, showed strong results comparable to Anthropic’s best models. GPT-5.4—the best-performing model from OpenAI—also performed relatively well on the benchmark, providing “Exemplary” responses on 54 percent of questions and achieving an 88.9 mean score.

    Unsurprisingly, recent frontier models showed a much stronger tendency to resist Russian propaganda than models from just a few years ago. Claude 3.5 Haiku—the highest-rated model released in 2024—received a mean rating of just 73.1 on the benchmark. That mark would put it in the bottom third of models released in 2026 on this metric.



    Detailed benchmarks for Google’s Gemini 2.5 Pro model show particularly sensitivity to malicious prompts and prompts in Russian.

    Detailed benchmarks for Google’s Gemini 2.5 Pro model show particularly sensitivity to malicious prompts and prompts in Russian.


    Credit:

    Estonian Language Institute


    But that improvement over time was not uniform across all LLM makers. Google’s most propaganda-resistant LLM, Gemini 2.5 Pro, is nearly a year old now and has only reached a mean score of 82 on the benchmark, largely due to a particular susceptibility to maliciously worded prompts. The most recent tested Google model, Gemini 3.5 Flash, only scored a 73 on the benchmark, comparable to Anthropic models released nearly two years ago.

    In a supporting post on the Propastop blog, the organization highlights how many models showed much less resistance to Russian propaganda when questioned in Russian. Google’s Gemini 3.5 Flash received significantly lower benchmark scores in Russian than in English, as did open-weight models like Moonshot’s Kimi K2 and StepFun’s Step 3.5 Flash.

    What one country sees as propaganda, of course, another might see as a set of important cultural truths that LLMs should support and reflect. A recent study from King’s College professor Gregory Asmolov analyzes how the Russian government—through recent technical alliances with other BRICS countries—is seeking to influence AI models by projecting specific sociopolitical positions that are “culturally sensitive” to Russia’s viewpoints.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    franperez66q@protonmail.com
    • Website

    Related Posts

    Three key vital signs make up the “urban pulse” of a city

    June 10, 2026

    Stocks recover from sell-off, but even bulls warn of more volatility

    June 10, 2026

    Locked in heated rivalry with researcher, Microsoft fixes 0-day they disclosed

    June 10, 2026

    How Jim Cramer views Apple on Day 2 of its post-WWDC keynote pullback

    June 10, 2026

    Starlink charges $10 monthly hardware fee in move away from one-time purchases

    June 10, 2026

    Entergy CEO pushes back on fears that AI data centers will drive up electricity bills

    June 9, 2026
    Leave A Reply Cancel Reply

    Top Reviews
    Editors Picks

    CrowdStrike warns of increasing Chinese AI cyberattacks on U.S. tech

    June 10, 2026

    Lenovo slides nearly 10% amid reports of price hikes due to memory costs

    June 10, 2026

    Three key vital signs make up the “urban pulse” of a city

    June 10, 2026

    These dividend stocks have slashed their payments, Morgan Stanley says

    June 10, 2026
    © 2026 All right reserved
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.