Ryan Whitwam / Ars Technica:
Google launches multi-token prediction writers for its Gemma 4 models, which use a form of speculative decoding to guess future tokens for faster inference— Google launched its Gemma 4 open models this spring, promising a new level of power and performance for local AI.
Trending
- Google launches multi-token prediction writers for its Gemma 4 models, which use a form of speculative decoding to guess future tokens for faster inference (Ryan Whitwam/Ars Technica)
- Former Yankee Mariano Rivera says he supports MLB salary cap
- Obama bluntly denounces Trump’s abuse of power
- SCOTUS rejects Apple’s request to temporarily block a court order in the Epic Games lawsuit that found Apple in violation of court-ordered App Store changes (Mike Scarcella/Reuters)
- OpenAI partners with researchers from Microsoft, AMD, Broadcom, Nvidia and Intel to detail the Multipath Reliable Connection (MRC) protocol to facilitate large-scale computing (Nat Rubio-Licht/The Deep View)
- CVS Health (CVS) First Quarter 2026 Results
- DefiLlama: Investors withdrew about $14 billion from the DeFi sector after North Korea-linked hackers stole $290 million from Aave in April, weeks after stealing $280 million from Drift (Financial Times)
- Study: U.S. schools that strictly ban cellphones in class haven’t had higher test scores on average so far, but students have reported a greater sense of well-being (Dana Goldstein/New York Times)

