DiffusionGemma is here. But what is Diffusion?
If you've used image or video gen models then you kind of already know what Diffusion is, without actually knowing what it is.
Diffusion is the art of processing your request in patterns rather than linearly/sequentially.
When you talk to say GPT 5.5, your prompt/message is being processed word for word, token by token. To ensure GPT can respond properly. Quite an expensive way to process prompts.
It's like talking to a PHD level expert and asking them how planes stay in the air, then immediately ask how to make a cheese sandwich.
Your expert will answer both questions properly but it will cost the expert it's brain cells and capacity.
In plain english this means diffusion models don't need to process each word/token in a sequence. Which saves A LOT on compute power.
What's the real upside?
Over 1,000 tokens per second. On consumer hardware like 's.
Mind boggling speeds right?
And we're just getting started. This is the 3rd generation of Diffusion LLMs on the market and the first from Google.
Can't wait to see other providers building in the diffusion space soon because damn guys we're going into hyperspace mode!!
What could you do with 1,000 tokens per second speeds?
How many websites, apps, softwares, or client solutions could you build IN A DAY now?
Loads I say.
0
0 comments
Hamish Aman Prakash
5
DiffusionGemma is here. But what is Diffusion?
Open Source AI Builder's Club
skool.com/open-source-ai-builders-club
The #1 Club for all developers, builders and innovators in Open Source AI Models, Apps and FREE Alternatives to Paid & Expensive tools!
Leaderboard (30-day)
Powered by