What of it? For me too, it was around that time last year, with GPT-5, Claude So...

kmaitreys · 2026-04-17T04:28:43 1776400123

I think there's a lot of difference between sounding like someone and being someone. The models are excellent at pretending indeed.

falcor84 · 2026-04-17T13:01:43 1776430903

I don't think that sama was arguing that ChatGPT actually passed a PhD thesis defense. But arguably, it could make for an interesting benchmark.

kmaitreys · 2026-04-17T14:19:30 1776435570

Please do not get swayed by nor defend the words vomited by a snake oil salesman.

Also what benchmark? How will you you design it?

0123456789ABCDE · 2026-04-17T04:54:47 1776401687

exactly. this is what whole RL thing is optimizing for, even if that is not the intent.