Critical analysis of the two most powerful new models behind ChatGPT, o3 and o4-mini. Not just the system cards, benchmarks,…
Browsing: AI Explained
Giving some context to a hectic week of AI news. This video won’t just be about the release, then, of…
The latest on Llama 4, and whether it signals a slowdown in AI, or solid progress. Plus, a deep dive…
Gemini gets a new record on Simple Bench, and several other benchmarks. I’ll go deep to explore its nuances, including…
Gemini 2.5 is out, on the same day as the new DeepSeek V3 (which should power Deepseek R2). Do both…
I’ve spent quite a while testing the new 4o ImageGen from OpenAI, and comparing it to models released just yesterday,…
Is Manus AI the memecoin of the AI world, or legit? I’ll compare it to OpenAI’s Deep Research, Operator, Grok…
GPT 4.5 is here, and do you remember when AI lab CEOs like Sam Altman and Dario Amodei were betting…
Claude 3.7 is here, hot on the heels of Grok 3 and a host of other developments, but how good…
A ‘frontier reasoning model’ from just 1000 examples (s1). A $100B Musk bid for power. Gemini 2, Rand and warning…