Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The author's battle is that it doesn't solve the class problem correctly 100% of the time like a one off hack does, not that it does nothing for it at all. https://i.imgur.com/Ar3rlJ1.png


This was the case with 4o as well. It sometimes solved it, sometimes didn't.


One could say that down to a markov chain with noise, the perspective is the new model named after the problem solves it significantly more reliably than the previous without a problem specific hack.

It's also worth noting the current model is the lower scoring o1-preview, not o1.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: