LLM Training
Model Training
Three Phases of Model Training
Blog: 解密 LLM 訓練三部曲:深入解析 SFT 與關鍵的 RLHF 技術
- 第一階段 (Self-Supervised Pre-Training):Pre-trained LLM
- 第二階段 (Supervised Fine-Tuning):SFT LLM
- 第三階段 (Reinforcement Learning from Human Feedback):Reward Model 與 Final Model
Blog: RLHF: Reinforcement Learning from Human Feedback

Pre-Train & Alignment (SFT, RLHF)
Post-Training & Forgetting
Build a Large Language Model (From Scratch)
Book: 讓 AI 好好說話!從頭打造 LLM (大型語言模型) 實戰秘笈

Build A Reasoning Model (From Scratch)
This site was last updated December 09, 2025.