The AI Evaluation Substack
Subscribe
Sign in
Home
Archive
About
2025 December "AI Evaluation" Digest
Call for Tributes: Your test of time.
Dec 26, 2025
•
AI Evaluation
14
1
Latest
Top
Discussions
2025 November "AI Evaluation" Digest
Hitting a wall? Seeing is all you need
Nov 28, 2025
•
AI Evaluation
23
1
4
2025 October "AI Evaluation" Digest
"Beware; for I am fearless, and therefore powerful.”
Oct 31, 2025
•
AI Evaluation
19
2
2
Is the Definition of AGI a Percentage?
Zachary Tidler, Marko Tešić, Lorenzo Pacchiardi, John Burden, Lexin Zhou, Manuel Cebrián, Fernando Martínez-Plumed, Jose Hernandez-Orallo
Oct 31, 2025
•
Lorenzo Pacchiardi
,
Lexin Zhou
,
Manuel Cebrian
,
Fernando Martínez-Plumed
,
Jose H. Orallo
,
Zack Tidler
, and
Marko Tesic
13
2
2025 September "AI Evaluation" Digest
What could possibly go wrong?
Sep 26, 2025
•
AI Evaluation
28
2
2025 August "AI Evaluation" Digest
Between a rock and a hard place
Aug 29, 2025
•
AI Evaluation
9
1
2025 July "AI Evaluation" Digest
Long live OpenML!
Jul 25, 2025
•
AI Evaluation
12
4
4
2025 June "AI Evaluation" Digest
Illusion is all you need
Jun 27, 2025
•
AI Evaluation
11
1
2
See all
The AI Evaluation Substack
A monthly digest of the latest developments, research trends and key initiatives in the realm of AI evaluation.
The AI Evaluation Substack
About
Archive
Sitemap