← back to notes
NLP / LLMs / Agents
Reinforcement Fine-Tuning
draft
Not written yet.
This page is a placeholder. Reinforcement Fine-Tuning is on the list of things I want to write a working note about — once I have something to say that isn’t already in a textbook.
If you’d like to nudge me toward writing this one specifically, send a note to videetnimsarkar21@gmail.com.