Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I along with others, years ago, have said that answers that don't address the question asked should be removed. In this case, malicious actors are using packages that are not related to the problem presented in the question. This will become more prevalent than package squatting.


Particularly if LLMs are being trained on this material.


I wonder if there's any weighting done based on things like upvotes in their training sets, or if the consider all answers equally?


I thought that was the entire ”value” of Twitter, Reddit, SO and other such platforms for LLM training?

It’s not like we haven’t got higher quality corpora, on average. They’re just poorly annotated.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: