Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

To add to the excellent resources that have already been posted, Chapter 9 of Jurafsky and Martin's "Speech and Language Processing" has a nice overview of attention, and the next chapter talks specifically about the Transformer architecture: https://web.stanford.edu/~jurafsky/slp3/


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: