Download "OpenAI just dropped GPT-5.2... (WOAH)"

Download this video with UDL Client
  • Video mp4 HD+ with sound
  • Mp3 in the best quality
  • Any size files
Video tags
|

Video tags

ai
llm
artificial intelligence
large language model
openai
mistral
chatgpt
ai news
claude
anthropic
apple ai
apple intelligence
llama
meta ai
google ai
Subtitles
|

Subtitles

00:00:00
GPT 5.2 is here and yes, it is an
00:00:02
incredible model. OpenAI published so
00:00:05
much information about it. We got a ton
00:00:06
of benchmarks. We have a bunch of demos
00:00:08
that I'm going to show you. So, let's
00:00:10
get right into it. But first, Flavio
00:00:11
Adamo did his famous bouncy balls in a
00:00:14
hexagon test and he really took it to
00:00:16
another level. Look at this. We now have
00:00:18
a 3D realistic looking hexagon. 3D
00:00:22
balls. They bounce off of each other.
00:00:24
The physics look real. The lighting
00:00:26
looks real. When they hit each other,
00:00:28
they brighten up for a moment. It is
00:00:31
incredibly impressive. Here's a test
00:00:33
that Ethan Mollik put together. Create a
00:00:35
visually interesting shader that can run
00:00:37
in twiggle.app. Make it like an infinite
00:00:40
city of Neo Gothic towers partially
00:00:42
drowned in a stormy ocean with large
00:00:44
waves. Single shot. Look at this.
00:00:47
Extremely impressive. The water physics
00:00:49
looks real, although it looks low
00:00:51
polygon, but that's okay. The buildings
00:00:54
look incredible. And yes, it is
00:00:56
infinite. And of course, I know you all
00:00:58
want to see what are the benchmarks and
00:01:00
how does it compare to the other
00:01:02
Frontier models. So, Swebench Pro, a big
00:01:05
5% jump from 5.1 to 5.2 and now GPT 5.2
00:01:09
thinking is state-of-the-art. It is the
00:01:11
best model on the planet for the
00:01:13
SweetBench Pro benchmark. We have GPQA
00:01:17
Diamond, which is a science benchmark
00:01:19
with no tools. 92.4%
00:01:23
that is a 4% bump from 5.1 thinking. And
00:01:26
yes, once again, state-of-the-art. We
00:01:29
have a new 100% aced it Amy 2025
00:01:35
benchmark. Completely aced it. Got every
00:01:37
single question right. Amy 2025 is a
00:01:40
math competition. Very difficult math
00:01:43
problems. And 5.2 aced it. That's
00:01:46
compared to 95% with Gemini 3 Pro and
00:01:48
92.8 with Claude Opus 4.5. Now, here is
00:01:52
the big one. Here is the surprising one.
00:01:55
Arc AGI 2. It went from 17% on 5.1
00:01:59
thinking to 52.9%.
00:02:02
That is a stunning increase and is by
00:02:06
far state-of-the-art for ARC AGI2. And
00:02:08
if you're not familiar with that
00:02:09
benchmark, it is a benchmark from our
00:02:11
friends at ARC Prize and tests the
00:02:13
model's ability to learn and generalize.
00:02:17
Probably the truest definition of AGI.
00:02:20
And even ARC prize posted about it. Look
00:02:23
at this. We verified GPT 5.2 Pro High is
00:02:26
state-of-the-art for ARC AGI2 scoring
00:02:29
54.2% at $15.72
00:02:33
per task. That is extremely
00:02:35
costefficient. And what's even more
00:02:38
impressive is the efficiency improvement
00:02:40
as compared to just a year ago. Listen
00:02:43
to this. A year ago, we verified a
00:02:45
preview of an unreleased version of 03
00:02:47
High that scored 88% at an estimated
00:02:49
cost of $4,500
00:02:52
per task. Today, we verified GPT 5.2 Pro
00:02:56
XH High, 90.5, so a better score at $11.
00:03:02
$11 per task. That is down from $45,000
00:03:07
per task. That is a 390x
00:03:10
efficiency improvement. It is not just
00:03:13
about getting the best score. It is also
00:03:16
about efficiency per token. And yes,
00:03:19
both ARC AGI 1 and ARC AGI 2 for GBT 5.2
00:03:23
thinking both stateofthe-art as compared
00:03:26
to the other frontier models, Opus 4.5
00:03:29
and Gemini 3 Pro. Last GDP val which
00:03:32
tests real world knowledge task
00:03:35
evaluations. And once again 5.2 2 is
00:03:40
dominating at a 70.9%
00:03:43
as compared to second place Opus 4.5 at
00:03:46
59.6%.
00:03:48
Incredibly impressive scores. And by the
00:03:50
way, I've started including some
00:03:52
mentions of prediction markets like Khi
00:03:54
and Poly Market in my videos. Let me
00:03:56
know if you like those in the comments
00:03:57
below. I find them pretty interesting.
00:03:59
So this prediction says which AI company
00:04:01
will have the best coding model on
00:04:02
January 1st, 2026. And if we look all
00:04:05
the way up until about December 5th or
00:04:08
6th, Anthropic was by far in the lead
00:04:11
and then all of a sudden it switched. So
00:04:13
definitely seems like someone knows
00:04:15
something. But as of right now, there's
00:04:17
an 86% chance that OpenAI will have the
00:04:20
best coding model on January 1st, 2026.
00:04:24
And we're doing a major giveaway. Let me
00:04:27
tell you about it. We just launched the
00:04:29
high taste AI bundle with Zapier,
00:04:32
GitHub, Microsoft Copilot, Windsurf,
00:04:34
Replet, Crew AI, Gemini, and so many
00:04:38
more. There are over $15,000 in total
00:04:41
prizes up for grabs. All you need to do
00:04:43
is go sign up for our newsletter, share
00:04:46
it with your friends, and the more
00:04:48
referrals that you get, the higher your
00:04:50
chances of winning. The grand prize
00:04:51
winner wins over $12,000 in credits,
00:04:54
seats, and pro plans from all the
00:04:57
companies that I mentioned. We're going
00:04:58
to choose a winner on December 26. So,
00:05:00
make sure you enter. Links down below.
00:05:03
Now, to the blog post released by OpenAI
00:05:06
for GPT 5.2. They are putting a lot of
00:05:08
emphasis on GPT 5.2 being incredibly
00:05:10
good and a big improvement from 5.1 on
00:05:14
real world tasks, economically valuable
00:05:16
tasks. All right, so who gets it and
00:05:18
when? Well, 5.2 2 is available right
00:05:20
away for paid users. That includes the
00:05:23
instant, the thinking, and the pro
00:05:25
versions of 5.2 for all paid accounts.
00:05:29
Let me show you some examples of
00:05:31
economically valuable work that GPT 5.2
00:05:33
is able to do much better than 5.1.
00:05:36
First, workforce planner. The prompt
00:05:39
create a workforce planning model,
00:05:40
headcount, hiring plan, attrition, and
00:05:43
budget impact include engineering,
00:05:44
marketing, legal, and sales departments.
00:05:46
on the left. This is 5.1 thinking. And
00:05:49
as you can see, it is a very basic
00:05:51
looking Excel, which actually to be fair
00:05:54
is fine. You don't really need it to be
00:05:56
very pretty. It just needs to have the
00:05:58
right data. But if you look at the right
00:05:59
side, it was able to format the Excel in
00:06:02
a much more easily readable way. Here's
00:06:05
a cap table management Excel document.
00:06:08
On the left, we have 5.1 thinking. On
00:06:10
the right, we have 5.2 thinking. And the
00:06:12
interesting thing to note is 5.1
00:06:15
incorrectly calculated seed series A and
00:06:17
series B liquidation preferences. By the
00:06:20
way, these are very sophisticated
00:06:22
spreadsheets and specifically formulas.
00:06:25
5.1 incorrectly calculated seed series A
00:06:28
and series B liquidation preferences and
00:06:30
left majority of those rows blank,
00:06:32
leading to an incorrect final equity
00:06:33
payout calculation. This is a big deal.
00:06:36
Being able to trust an AI model to
00:06:38
create these cap tables is extremely
00:06:41
valuable, but only if they get it right.
00:06:44
The cost of getting it wrong is
00:06:46
enormous. Potentially millions and
00:06:48
millions of dollars, potentially
00:06:50
billions of dollars. And obviously, a
00:06:51
human is ultimately going to need to
00:06:53
review this and sign off on it, but it
00:06:55
is still incredibly important to get it
00:06:57
right. And GPT 5.2 gets it all right.
00:07:00
Here's another example. You are a
00:07:02
project manager at a UK-based tech
00:07:04
startup called BridgeMind. Bridgemine
00:07:06
successfully obtained grant funding from
00:07:07
a UK- based organization that supports
00:07:09
the development of AI tools to help
00:07:12
local businesses. This website provides
00:07:14
some background information. So, I won't
00:07:16
read the whole thing, but it basically
00:07:17
gives it a bunch of information and ask
00:07:19
it to create a report. And so, what we
00:07:21
see on the left side here, this is one
00:07:23
of the slides, the report on the right
00:07:25
side, as you can tell, much more
00:07:27
beautiful, much more easily readable.
00:07:29
And so, it definitely did a much better
00:07:31
job. Now, I'm looking right here, and
00:07:33
this is a tiny minor gripe, but you can
00:07:36
see that we have this rounded edge right
00:07:38
here. But for the header, we have this
00:07:40
squared off, and they're overlapping.
00:07:42
Very minor, very subtle visual detail
00:07:45
that I got wrong. Next, let me show you
00:07:47
maybe one of the most impressive demos
00:07:50
I've seen to date for a coding prompt.
00:07:52
First, here's the prompt. Create a
00:07:54
single page app in a single HTML file
00:07:56
with the following requirements. Name:
00:07:58
ocean wave simulation goal. Display
00:08:00
realistic animated waves. Features:
00:08:03
Change wind speed, wave height,
00:08:04
lighting. The UI should be calming and
00:08:06
realistic. And look at that. It looks
00:08:09
phenomenal. I don't really understand
00:08:11
why it looks like you're looking through
00:08:13
a port hole or something, but you know,
00:08:15
that's fine. But if we change the wind
00:08:18
speed, we can see if we turn it all the
00:08:19
way down, it's very calm waves. And of
00:08:21
course, if we turn it all the way up, we
00:08:23
have much more turbulent waves. We can
00:08:25
also change the wave height just like
00:08:27
so. So now we have really big waves. And
00:08:30
then all of a sudden, we can move it
00:08:31
down to the bottom and have much smaller
00:08:33
waves. Let's turn the wind speed all the
00:08:35
way up. This is the calmst ocean you can
00:08:36
possibly think of. And then of course on
00:08:39
the other side, there we go. And you can
00:08:40
also change the lighting like so. And
00:08:42
GPT 5.2 also hallucinates less than 5.1.
00:08:47
It's a minor improvement, but still very
00:08:49
meaningful. Any squashing of
00:08:50
hallucinations is definitely worth
00:08:52
noting. GBT 5.1 thinking with at least
00:08:56
one error in its response. We'll call
00:08:58
those hallucinations. GPT 5.2 thinking
00:09:00
6.2%.
00:09:02
A very welcomed improvement. It also
00:09:04
sets a new standard in long context
00:09:06
reasoning. So, we get 256K tokens just
00:09:10
like we did in 5.1 in previous models.
00:09:12
So, no big expansion of the context
00:09:14
window there. But what it can actually
00:09:16
do with the context window has greatly
00:09:18
improved. Look at this. So, as we see,
00:09:20
this is 5.1 thinking, and this is the
00:09:23
MRCV2 benchmark with four needles, very
00:09:26
similar to the needle in the haststack
00:09:28
test. And as we can see for 5.1
00:09:31
thinking, it drops all the way down to
00:09:33
42% at 256K tokens. Whereas nearly
00:09:38
perfect at 98% for 256K tokens with four
00:09:42
needles in the haststack. Now, when we
00:09:44
have eight needles, it dropped off a
00:09:46
little bit more, coming down to 70% as
00:09:49
compared to 30% for 5.1. It's also much
00:09:52
better at visual reasoning, which is a
00:09:55
very important feature, something that I
00:09:56
use all the time. It cuts error rates
00:09:58
roughly in half on chart reasoning and
00:10:00
software interface understanding. Here's
00:10:02
the charchive reasoning, scientific
00:10:04
figure questions, benchmark, and 5.1
00:10:07
thinking, 80%, 5.2 thinking, 88%. So, a
00:10:10
nice improvement there. And a quick
00:10:12
thanks to Dell Technologies for
00:10:14
sponsoring this portion of the video.
00:10:15
Dell Technologies has a family of
00:10:17
incredible laptops called the Dell Pro
00:10:19
Max featuring Nvidia RTX Pro Blackwell
00:10:22
chips, which are portable AI workh
00:10:25
horses. It comes in 14 and 16in screen
00:10:28
sizes and up to 32 GB of GPU memory.
00:10:31
Perfect for onthe-go AI workloads. Check
00:10:34
them out. Link in the description below.
00:10:36
And of course for computer use we needed
00:10:39
to be able to understand what's on the
00:10:41
screen specifically user interfaces to
00:10:43
know where the buttons are how to click
00:10:45
them and 5.2 is much improved over 5.1.
00:10:49
So for the screen spot pro guey
00:10:52
screenshot understanding 64% for 5.1
00:10:55
thinking and 86% for 5.2. And here's
00:10:58
another example at just how good it is
00:11:00
at visual reasoning. Here's a picture of
00:11:02
a motherboard. 5.1 it was asked to
00:11:04
identify all the different elements of
00:11:06
it. And as you can see, not very good,
00:11:08
not accurate on the actual boxing and
00:11:12
only identified for elements of it
00:11:14
versus 5.2
00:11:17
really did much better, identified many
00:11:19
more ports and chips and RAM and boxed
00:11:22
it much more accurately. And for tool
00:11:24
use, we also saw a great improvement.
00:11:27
Here's TA 2 bench for telecom use cases.
00:11:29
This is customer support. nearly perfect
00:11:32
98.7 for 5.2 thinking versus here's 5.1
00:11:36
thinking 47% double literally double and
00:11:40
so what that actually means is 5.2 is
00:11:43
much better at tool calling and long
00:11:45
chains of tool calling. Here's an
00:11:47
example. My flight from Paris to New
00:11:48
York was delayed and I missed my
00:11:50
connection to Austin. My check bag is
00:11:53
also missing and I need to spend the
00:11:54
night in New York. I also require a
00:11:57
special front row seat for medical
00:11:58
reasons. Can you help me? 5.1, it was
00:12:01
only able to do tool calling over a
00:12:03
handful of different iterations versus
00:12:05
5.2, look how many more it was able to
00:12:07
do. It is also much better at mental
00:12:09
health evaluations, which is equally as
00:12:12
important as everything else we're
00:12:14
discussing. When people use these
00:12:16
models, we need to know that they are
00:12:17
safe while using them. So, let's talk
00:12:19
about pricing. It is more expensive,
00:12:22
unfortunately. I thought a version bump
00:12:24
would be the same price, but it isn't.
00:12:26
You do get a lot more for it, but still
00:12:28
it is a pretty meaningful price
00:12:30
increase. So, GPT 5.1 per million input
00:12:34
tokens, $125
00:12:37
versus $1.75
00:12:39
for 5.2. That is a big increase. And $10
00:12:42
per output for 5.1 versus $14 per
00:12:46
million output for 5.2. And last,
00:12:48
Almarina published some preliminary
00:12:51
results. And GBT 5.2 too high is coming
00:12:55
in at an ELO score of 1486, putting it
00:12:58
at number two overall, coming in just
00:13:01
under Opus 4.5, which is still the
00:13:04
number one coding model in the world
00:13:06
according to LM Marina. So, I'll drop a
00:13:08
link to this blog post down below. They
00:13:10
did put out a ton of different
00:13:12
benchmarks. I would have liked if they
00:13:15
included the other companies, Frontier
00:13:17
models, and all of these different
00:13:19
charts for the benchmarks, but hopefully
00:13:21
they do that next time. Box put out some
00:13:23
nice enterprise benchmarks. No, they're
00:13:25
not sponsoring this video, but I just
00:13:27
find this to be valuable. So, let me
00:13:28
show it to you. What we're seeing here
00:13:30
is time to first token for Box AAI
00:13:32
Enterprise Eval. And so, lower is
00:13:34
better. And what we're seeing here is a
00:13:36
drastic decrease from 5 to 5.1 to 5.2
00:13:40
across the board. Long document complex
00:13:43
extraction, analytical query, and
00:13:45
multi-turn query. Then we're seeing
00:13:47
accuracy scores for those enterprise use
00:13:50
cases. And once again, 59 to 66. And as
00:13:53
you can see pretty much across the
00:13:55
board, a very nice improvement. Once
00:13:57
again, I would have liked to see how it
00:13:59
compares to other Frontier models in the
00:14:01
same chart, but maybe next time. So
00:14:04
that's it for today. It seems like
00:14:05
pre-training is not slowing down. We
00:14:07
have not hit a wall. Very excited to see
00:14:09
these big improvements to the GPT5
00:14:12
series of models. If you enjoyed this
00:14:14
video, please consider giving a like and
00:14:15
subscribe.

Description:

Check out the Dell Pro Max Workstation with the NVIDIA RTX PRO! https://www.dell.com/en-us/lp/dt/nvidia-ai Enter the AI Bundle Giveaway with over $15k in prizes!👇🏼 https://www.forwardfuture.ai/the-high-taste-ai-bundle Download The Subtle Art of Not Being Replaced 👇🏼 https://www.forwardfuture.ai/one-hundred-ways-to-use-ai-lp Download Humanities Last Prompt Engineering Guide 👇🏼 https://www.forwardfuture.ai/p/humanity-s-last-prompt-engineering-guide Join My Newsletter for Regular AI Updates 👇🏼 https://www.forwardfuture.ai/ Discover The Best AI Tools👇🏼 https://tools.forwardfuture.ai/ My Links 🔗 👉🏻 X: https://x.com/matthewberman 👉🏻 Forward Future X: https://x.com/forward_future_ 👉🏻 Instagram: https://www.facebook.com/unsupportedbrowser 👉🏻 Discord: https://discord.com/invite/xxysSXBxFW 👉🏻 TikTok: https://www.tiktok.com/@matthewberman_ai Media/Sponsorship Inquiries ✅ https://crswhd1x01z.typeform.com/to/BX2fuHIe Links: https://openai.com/index/introducing-gpt-5-2/ https://x.com/flavioAd/status/1999183432203567339 https://x.com/arcprize/status/1999182732845547795 https://x.com/arena/status/1999183339283185878 https://blog.box.com/how-openais-gpt-52-delivers-lightning-fast-specialist-level-reasoning

Mediafile available in formats

popular icon
Popular
hd icon
HD video
audio icon
Only sound
total icon
All
* — If the video is playing in a new tab, go to it, then right-click on the video and select "Save video as..."
** — Link intended for online playback in specialized players

Questions about downloading video

question iconHow can I download "OpenAI just dropped GPT-5.2... (WOAH)" video?arrow icon

    http://univideos.ru/ website is the best way to download a video or a separate audio track if you want to do without installing programs and extensions.

    The UDL Helper extension is a convenient button that is seamlessly integrated into YouTube, Instagram and OK.ru sites for fast content download.

    UDL Client program (for Windows) is the most powerful solution that supports more than 900 websites, social networks and video hosting sites, as well as any video quality that is available in the source.

    UDL Lite is a really convenient way to access a website from your mobile device. With its help, you can easily download videos directly to your smartphone.

question iconWhich format of "OpenAI just dropped GPT-5.2... (WOAH)" video should I choose?arrow icon

    The best quality formats are FullHD (1080p), 2K (1440p), 4K (2160p) and 8K (4320p). The higher the resolution of your screen, the higher the video quality should be. However, there are other factors to consider: download speed, amount of free space, and device performance during playback.

question iconWhy does my computer freeze when loading a "OpenAI just dropped GPT-5.2... (WOAH)" video?arrow icon

    The browser/computer should not freeze completely! If this happens, please report it with a link to the video. Sometimes videos cannot be downloaded directly in a suitable format, so we have added the ability to convert the file to the desired format. In some cases, this process may actively use computer resources.

question iconHow can I download "OpenAI just dropped GPT-5.2... (WOAH)" video to my phone?arrow icon

    You can download a video to your smartphone using the website or the PWA application UDL Lite. It is also possible to send a download link via QR code using the UDL Helper extension.

question iconHow can I download an audio track (music) to MP3 "OpenAI just dropped GPT-5.2... (WOAH)"?arrow icon

    The most convenient way is to use the UDL Client program, which supports converting video to MP3 format. In some cases, MP3 can also be downloaded through the UDL Helper extension.

question iconHow can I save a frame from a video "OpenAI just dropped GPT-5.2... (WOAH)"?arrow icon

    This feature is available in the UDL Helper extension. Make sure that "Show the video snapshot button" is checked in the settings. A camera icon should appear in the lower right corner of the player to the left of the "Settings" icon. When you click on it, the current frame from the video will be saved to your computer in JPEG format.

question iconHow do I play and download streaming video?arrow icon

    For this purpose you need VLC-player, which can be downloaded for free from the official website https://www.videolan.org/vlc/.

    How to play streaming video through VLC player:

    • in video formats, hover your mouse over "Streaming Video**";
    • right-click on "Copy link";
    • open VLC-player;
    • select Media - Open Network Stream - Network in the menu;
    • paste the copied link into the input field;
    • click "Play".

    To download streaming video via VLC player, you need to convert it:

    • copy the video address (URL);
    • select "Open Network Stream" in the "Media" item of VLC player and paste the link to the video into the input field;
    • click on the arrow on the "Play" button and select "Convert" in the list;
    • select "Video - H.264 + MP3 (MP4)" in the "Profile" line;
    • click the "Browse" button to select a folder to save the converted video and click the "Start" button;
    • conversion speed depends on the resolution and duration of the video.

    Warning: this download method no longer works with most YouTube videos.

question iconWhat's the price of all this stuff?arrow icon

    It costs nothing. Our services are absolutely free for all users. There are no PRO subscriptions, no restrictions on the number or maximum length of downloaded videos.