AI Brief

AI的演進

Artificial Intelligence (AI) Evolution


Deep Learning (深度學習)


AI 的應用

CNN

  • Image Classification (影像分類)
  • Object Detection (物件偵測)

  • Pose Estimation (姿態估計)
  • Face Recognition (人臉識別)

GAN

  • Image Inpainting (修圖)
  • Deep Fake (換臉)
  • Stable Diffusion (生成圖片/影片)

RNN

  • Text-to-Speech(TTS) : 文字轉語音
  • Text-to-Text : Translation (翻譯), Text Generation(文本產生)
  • Generative Pretrained Transformers (GPT) : Q&A, Exam (考題問答)
  • Large Language Model(LLM): ChatGPT, Gemini, Grok Dataset: GSM8K (Grade School Math)

AI Competitions & Jobs


Generative AI (生成式人工智慧)

The 55 Best AI Tools for 2025 (Tried and Tested)

LLMs Timeline

AI agents comparison


Microsoft WHAM

Introducing Muse: Our first generative AI model designed for gameplay ideation


Large Language Model (大型語言模型)

GPT-5

GPT‑5 的智慧水準全方位大幅提升,從各項學術及人類評量的基準測試中可見一斑,在數學、程式設計、視覺感知與健康領域的表現尤其突出。它在數學 (在 AIME 2025 未使用工具的情況下達到 94.6%)、實際程式設計 (SWE-bench Verified 達 74.9%、Aider-Polyglot 達 88%)、多模態理解 (MMMU 達 84.2%) 和健康 (HealthBench Hard 達 46.2%) 領域的基準測試中,全面刷新最高記錄,而這些進步就體現在日常使用情境中。運用 GPT‑5 Pro 的延伸推理能力,這款模型還在 GPQA 中創下 SOTA 新紀錄,在不使用輔助工具的狀態下取得 88.4% 高分。

Grok-4

  • 推理能力大幅提升:AIME 數學、GPQA 科學問答測試表現領先,擅長複雜問題解構。
  • 專用編碼模式(Grok 4 Code):現場展示即時編寫並執行 HTML 與 Python 程式。
  • 全面多模態互動:支援文字、圖像與語音輸入,新增即時圖像生成。
  • 即時資料檢索(RAG 架構):與 X 平台整合,即時獲取新聞、趨勢貼文進行回答。
  • Hybrid Transformer-MoE 架構:提升運算效率與任務專業化。
  • 超大規模訓練:使用 xAI Colossus 超級電腦,訓練資源達 25 萬顆 Nvidia H100 GPU。

Gemini-2.5

Introducing the Gemini 2.5 Computer Use model
Prompt: From https://tinyurl.com/pet-care-signup, get all details for any pet with a California residency and add them as a guest in my spa CRM at https://pet-luxe-spa.web.app/. Then, set up a follow up visit appointment with the specialist Anima Lavar for October 10th anytime after 8am. The reason for the visit is the same as their requested treatment.


Text-to-Image

  • Grok Image 0.9

Text-to-Video


AGI - Artificial General Intelligence (通用人工智慧)

AGI stands for Artificial General Intelligence. It’s a theoretical level of AI development where a machine can understand, learn, adapt, and implement knowledge across a wide range of tasks, much like a human being.
Paper: Levels of AGI: Operationalizing Progress on the Path to AGI
Paper: GAIA: a benchmark for General AI Assistants

  • 推理型 LLM 的出現,加速了對 AGI 到來的那一天的想像。
  • AGI 更像是一種「資源」,而非「工具」
  • AGI 將使公司更傾向於裁員並停止招聘新人,因為人類勞動力不再具有經濟價值。

LLM Reasoning

Reinforcement Pre-Training

Microsoft and China AI Research Possible Reinforcement Pre-Training Breakthrough


AI News

2025

2024


AI的影響


AI的未來

Elon Musk latest interview


CES 2025 Jenson Keynote

AI Ascent 2025

長文導讀紅杉資本給創業者的戰略建議:AI 如何成為下一個兆元經濟?


AI Enpowerment (賦能)


Humanoid Robots(人形機器人)

Optimus Gen3

Walker S2


Figure 03


ADAM (DeepMind’s RoboTool)

Paper: ADAM: a robotic companion for enhanced quality of life in aging populations


NVIDIA Isaac GR00T N1


WRC 2025



This site was last updated October 26, 2025.