Cheng Ting-Fang / Nikkei Asia: AMD CEO Lisa Su predicts that the processor market will grow more than 35% annually…
Browsing: inference
Ben Thompson / Strategy: Agentic inference will be different from current inference and will change the computational infrastructure because speed…
Ryan Whitwam / Ars Technica: Google launches multi-token prediction writers for its Gemma 4 models, which use a form of…
Dina bass / Bloomberg: Cloud computing provider Nebius agrees to buy Eigen AI, which optimizes the performance of chips running…
Featured Podcasts Channels with Peter Kafka: The World Cup is coming to Trump’s America, with Roger Bennett Media and technology…
Wall Street Journal: Sources: Nvidia plans to unveil new AI inference chip at its GTC conference in March; the system…
Max A. Cherney / Reuters: Toronto startup Taalas, which embeds AI models in custom silicon to achieve faster inference, has…
Eric Jang / Evjang.com: A look at the state of AI agents, evolving thought patterns, the huge need for inference…
Gavin Boulanger / @gavinsbaker: As inference splits into pre-filling and decoding, Nvidia’s Groq deal could enable a “Rubin SRAM” variant…
Christophe Mims / Wall Street Journal: OpenAI’s Broadcom deal shows how the AI startup is diversifying its chip suppliers, including…