Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

And also the ones before that explain the attention mechanism:

https://youtu.be/wzfWHP6SXxY?t=4366

https://youtu.be/gKD7jPAdbpE (up to 25:42)



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: