Skip Navigation
Reinforcement Learning @lemmy.ca

A Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambert

rlhfbook.com /book.pdf

https://bsky.app/profile/natolambert.bsky.social/post/3lh5jih226k2k

Anyone interested in learning about RLHF? This text isn't complete yet, but looks to be a pretty useful resource as is already.

0 comments

No comments