A lot of comments here are dismissing this post because the relevant code was is...

felipeerias · 2026-04-12T02:42:30 1775961750

From the article:

> Our tests gave models the vulnerable function directly, often with contextual hints (e.g., "consider wraparound behavior").

grandinquistor · 2026-04-12T04:18:25 1775967505

I mean you can still scale that? Ask a lighter model to go through every function to find vulnerabilities, take output to bigger model like Opus and classify the critical ones.

make_it_sure · 2026-04-12T01:04:04 1775955844

check other comments, they didn't