Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The model is only generating tokens without touching the network at all, right? How would it send data away?


Theoretically, by taking the opportunity to inject an exfiltration mechanism if you ask it to write code for you


Lots of people I know run models in "yolo" mode or the equivalent as well, which means it could just invoke curl or telnet to exfiltrate data.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: