Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Target investors reject proposal for independent board chair- Reuters

    June 11, 2026

    Nobody needs AI to search the Internet, court says in ruling against Google

    June 11, 2026

    College sticker prices top $100,000 at 16 schools for 2026-27

    June 11, 2026
    Facebook X (Twitter) Instagram
    Addison Markets
    • Home
    • USA
    • Europe
    • Business
    • Investing
    • Tech
    • Politics
    • Contact Us
    Addison Markets
    Home»Tech»Google’s latest DiffusionGemma open AI model comes with a 4x speed boost
    Tech

    Google’s latest DiffusionGemma open AI model comes with a 4x speed boost

    franperez66q@protonmail.comBy franperez66q@protonmail.comJune 11, 2026No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email


    Another day, another AI model from Google. This time, Google DeepMind has released a new member of the Gemma 4 open model family, but it’s fundamentally different from the rest of the lineup. DiffusionGemma doesn’t generate outputs linearly like most AI models. Instead, it can produce an entire block of text in parallel. Google says this makes it faster and more efficient when running on local hardware like an Nvidia DGX or a humble gaming GPU.

    Most AI models are designed to be autoregressive—they generate text left to right one token at a time. DiffusionGemma has more in common with image generation models, which start with static and then denoise it to create the desired content. This model takes a field of placeholder tokens running over the canvas multiple times to generate likely tokens and using those to improve estimation of others. At the end of the process, the model finalizes its token outputs in one large block—the “denoised” text canvas.

    DiffusionGemma is fairly large in the realm of Google’s open models. It’s a Mixture of Experts (MoE) model with a total of 26 billion parameters, but only 3.8 billion are activated during inference. That means it should fit in the 18GB RAM allotment of a high-end GPU. In testing with an RTX 5090, DiffusionGemma spits out around 700 tokens per second. With a single Nvidia H100 AI accelerator, DiffusionGemma can produce 1,000+ tokens per second. That’s about four times the output of the similarly sized autoregressive Gemma models.



    This approach to text generation shifts the bottleneck from memory bandwidth to compute, generating up to 256 tokens in parallel. Google says this offers a measurable boost in non-linear tasks like in-line editing, molecular sequencing, and mathematical graphing. The animation above shows how DiffusionGemma was tuned to solve Sudoku puzzles, which is a notoriously challenging task for standard autoregressive AI models because each token depends on future tokens. DiffusionGemma’s ability to continuously self-correct large sets of tokens makes that easier.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    franperez66q@protonmail.com
    • Website

    Related Posts

    Nobody needs AI to search the Internet, court says in ruling against Google

    June 11, 2026

    OpenAI mulls slashing prices ahead of competition from Anthropic: WSJ

    June 11, 2026

    Man sues Florida cops over arrest spurred by “93% match” in facial recognition

    June 11, 2026

    Jim Cramer says one of SpaceX’s biggest risks is this group of investors

    June 11, 2026

    Diabetes org apologizes for ejecting scientists over criticism of Trump

    June 10, 2026

    Cramer still likes FedEx Freight despite emergence of new rival

    June 10, 2026
    Leave A Reply Cancel Reply

    Top Reviews
    Editors Picks

    Target investors reject proposal for independent board chair- Reuters

    June 11, 2026

    Nobody needs AI to search the Internet, court says in ruling against Google

    June 11, 2026

    College sticker prices top $100,000 at 16 schools for 2026-27

    June 11, 2026

    Asia stocks slide on extended tech rout, US-Iran escalation

    June 11, 2026
    © 2026 All right reserved
    • Privacy Policy
    • Terms & Conditions

    Type above and press Enter to search. Press Esc to cancel.