PERSTECHTIVES
Subscribe
Sign in
Share this post
PERSTECHTIVES
Vibes, benchmarks and the “evaluation crisis” of intelligence
Copy link
Facebook
Email
Notes
More
Vibes, benchmarks and the “evaluation crisis…
Martin Signoux
Mar 4
2
Share this post
PERSTECHTIVES
Vibes, benchmarks and the “evaluation crisis” of intelligence
Copy link
Facebook
Email
Notes
More
2
GPT-4.5 reminded us of a simple truth: evaluating intelligence is hard.
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
Vibes, benchmarks and the “evaluation crisis…
Share this post
GPT-4.5 reminded us of a simple truth: evaluating intelligence is hard.