Lecture

LLM Training

13 Sep 2025

Model Training

Three Phases of Model Training

Blog: 解密 LLM 訓練三部曲：深入解析 SFT 與關鍵的 RLHF 技術

第一階段 (Self-Supervised Pre-Training)：Pre-trained LLM
第二階段 (Supervised Fine-Tuning)：SFT LLM
第三階段 (Reinforcement Learning from Human Feedback)：Reward Model 與 Final Model

Blog: RLHF: Reinforcement Learning from Human Feedback

Pre-Train & Alignment (SFT, RLHF)

Post-Training & Forgetting

Build a Large Language Model (From Scratch)

Book: 讓 AI 好好說話！從頭打造 LLM (大型語言模型) 實戰秘笈

Build A Reasoning Model (From Scratch)

This site was last updated December 09, 2025.

genai