Reinforcement
an archive of posts with this tag
| Oct 15, 2023 | Understanding Phenomenal REINFORCE Policy Gradient Method |
|---|---|
| Jul 28, 2023 | Reinforcement Learning from Human Feedback (RLHF) Presentation Slides |
| Jul 12, 2023 | GPT-4 Presentation Slides |
| May 12, 2023 | Proximal Policy Optimization (PPO) Presentation Slides |