The author's battle is that it doesn't solve the class problem correctly 100% of...

paxys · on Sept 13, 2024

This was the case with 4o as well. It sometimes solved it, sometimes didn't.

zamadatix · on Sept 13, 2024

One could say that down to a markov chain with noise, the perspective is the new model named after the problem solves it significantly more reliably than the previous without a problem specific hack.

It's also worth noting the current model is the lower scoring o1-preview, not o1.