Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
DeepSeek's Dualpath Paper explained with animations (mesuvash.github.io)
2 points by mesuvash 49 days ago | past
Reinforcement Learning for LLMs (mesuvash.github.io)
2 points by gmays 51 days ago | past
Intuitive Intro to Reinforcement Learning for LLMs (mesuvash.github.io)
3 points by mesuvash 56 days ago | past
An Intuitive Introduction to PPO and GRPO (mesuvash.github.io)
5 points by mesuvash 58 days ago | past | 2 comments
Hashing for large-scale similarity (mesuvash.github.io)
57 points by suphyr on Feb 11, 2019 | past | 5 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: