Tags
2 pages
大模型对齐
RLHF/PPO/DPO介绍
阅读笔记:《THE UNLOCKING SPELL ON BASE LLMS RETHINKING ALIGNMENT VIA IN-CONTEXT LEARNING》