Reinforcement
an archive of posts with this tag
Oct 15, 2023 | Understanding Phenomenal REINFORCE Policy Gradient Method |
---|---|
Jul 28, 2023 | Reinforcement Learning from Human Feedback (RLHF) Presentation Slides |
Jul 12, 2023 | GPT-4 Presentation Slides |
May 12, 2023 | Proximal Policy Optimization (PPO) Presentation Slides |