1 stories
rlhfbook.com visit →
Stories 1 (0 evaluated) Avg HRCB ND
Avg SETL ND Avg Conf ND
Poster Karma 5,990 avg Submitters 1
1. Reinforcement Learning (I.e. Policy Gradient Algorithms) (rlhfbook.com)
2 points by vinhnx 8 days ago | 0 comments | skipped