Featured Podcasts Big Tech Podcast: Anthropic vs. the Pentagon, Bloodbath at Block, The Citrini Selloff The Big Technology Podcast takes…
Browsing: tests
Featured Podcasts Lenny’s podcast: A Child Psychologist’s Guide to Working with Difficult Adults | Dr. Becky Kennedy Interviews with world-class…
Bloomberg: Chinese startup Moonshot updates its Kimi model, claiming K2.5 can process text, images and video simultaneously and beats its…
Will Douglas Sky / MIT Technology Review: OpenAI is testing training LLMs to produce “confessions,” or self-report how they performed…
Jonathan Kemper / The decoder: Alibaba Technical Report: Qwen3-VL Beats GPT-5 and Gemini 2.5 Pro on Visual Tasks and Has…
Featured Podcasts The Talk Show with John Gruber: “Lincoln Bio Services,” with Stephen Robles The director’s commentary track for Daring…
Jared Perlo / NBC News: An Oxford Internet Institute study of 445 AI tests finds that many tests lack clear…
Featured podcasts Hard Fork: Sora and Infinite Slobe Foods + Chatgpt goes to therapy + Hot Mess Express The future…
Celia Ford / Transformer:: Anthropic system card: Claude Sonnet 4.5 has been able to recognize many alignment evaluation environments such…
Kurt Wagner / Bloomberg:: Mark Zuckerberg says that Instagram has reached 3B monthly users and tests a new navigation bar…