RL00 - A glimpse of Reinforcement Learning
This post summarizes reinforcement learning from classic tabular methods to ML-based approximations and recent LLM applications like RLHF.
This post summarizes reinforcement learning from classic tabular methods to ML-based approximations and recent LLM applications like RLHF.