LLM Training

Model Training

Three Phases of Model Training

Blog: 解密 LLM 訓練三部曲:深入解析 SFT 與關鍵的 RLHF 技術

  • 第一階段 (Self-Supervised Pre-Training):Pre-trained LLM
  • 第二階段 (Supervised Fine-Tuning):SFT LLM
  • 第三階段 (Reinforcement Learning from Human Feedback):Reward Model 與 Final Model

Blog: RLHF: Reinforcement Learning from Human Feedback


Pre-Train & Alignment (SFT, RLHF)


Post-Training & Forgetting


Build a Large Language Model (From Scratch)

Book: 讓 AI 好好說話!從頭打造 LLM (大型語言模型) 實戰秘笈

Build A Reasoning Model (From Scratch)



This site was last updated December 09, 2025.