Potemkin Understanding in Large Language Models

Created in March 23, 2026

2026

I was invited give a talk to OpenAI’s Economic Reserach team. The talk involved how to evaluate and benchmark large language models for depth of conceptual understanding. Here are the slides.

Enjoy Reading This Article?

Here are some more articles you might like to read next:

Google Gemini updates: Flash 1.5, Gemma 2 and Project Astra

Displaying External Posts on Your al-folio Blog

a post with tabs

a post with typograms

a post that can be cited