NLP / LLMs / Agents

Reinforcement Fine-Tuning

draft

Not written yet.

This page is a placeholder. Reinforcement Fine-Tuning is on the list of things I want to write a working note about — once I have something to say that isn’t already in a textbook.

If you’d like to nudge me toward writing this one specifically, send a note to videetnimsarkar21@gmail.com.

← browse the rest of the notebook