GPT 5.2: OpenAI Strikes Back
Manage episode 523914155 series 3701080
Full GPT-5.2 breakdown - did OpenAI reclaim the crown? A story of tokens, time and cost, plus 9 details you wouldn’t get just from reading the headlines.
https://www.youtube.com/@eightythousandhours
AI Insiders ($9!): https://www.patreon.com/AIExplained
https://lmcouncil.ai
Chapters:
00:00 - Introduction
00:55 - Better than Human @ Professional Tasks?
04:42 - Test time Compute
07:05 - Benchmark Selection
09:32 - Simple Results + council comparison
13:01 - Long Context
13:52 - Self-Improvement
15:00 - 10 Years + New Models
Release Page: https://openai.com/index/introducing-gpt-5-2/
GPT 5.2 Benchmark Comparison: https://www.reddit.com/r/singularity/comments/1pka1y9/gpt52_all_20_benchmarks_rankings_and_pricing/
https://storage.googleapis.com/gweb-uniblog-publish-prod/original_images/gemini_3_table_final_HLE_Tools_on.gif
https://lmcouncil.ai/benchmarks
Charxiv: https://charxiv.github.io/#leaderboard
GDPval: https://arxiv.org/pdf/2510.04374
My vid: https://www.youtube.com/watch?v=oK5LxMaROSA
Kilpatrick: https://x.com/OfficialLoganK/status/1999270402712023158/photo/1
Noam Brown: https://x.com/polynoamial/status/1999189845164667132
New Model in New Year: https://www.theinformation.com/articles/openai-developing-garlic-model-counter-googles-recent-gains?rc=sy0ihq
10 Years of OpenAI: https://openai.com/index/ten-years/
GPQA: https://x.com/idavidrein/status/1841265634170278063
ARC-AGI 1-2: https://arcprize.org/arc-agi/2/
Sunday Robotics: https://x.com/tonyzzhao/status/1991204839578300813
Non-hype Newsletter: https://signaltonoise.beehiiv.com/
https://lmcouncil.ai
43 episodes