Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Here's a grid of pelicans for the different models and reasoning levels: https://static.simonwillison.net/static/2026/gpt-5.4-pelican...


Surely this task must now be in the training data


If it does and works well then it seems like mission accomplished and time for a new benchmark.


Nano medium must have been run when the servers were on fire


Thanks for the grid. The nano xhigh is my favorite pelican


Some of these are nightmare fuel. I love them.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: