Tags
2 pages
LLM
RLHF/PPO/DPO介绍
阅读笔记:《THE UNLOCKING SPELL ON BASE LLMS RETHINKING ALIGNMENT VIA IN-CONTEXT LEARNING》