VLA
Vision-Language-Action model
Vision-Language-Action model
Reinforcement-Learning for Robot
Reinforcement Learning
Agent, MCP, Skills, OpenCode, OpenClaw, Hermes-Agent, Agent-OS
Sparse-AutoEncoder, Transcoder
Model FineTuning, Intelligence Benchmarks
Vision Language Model / Multimodal Large Language Model
Large Language Model
Text-to-Music, Text-to-Song
Image-to-Video, Text-to-Video, World-Model
Text-to-Image, Text-to-3D, Image-to-3D
Text-to-Speech, Voice Cloning, Speech Seperation, ASR
Style Transfer, Variational AutoEncoder, Generative Adversarial Network (生成對抗網路)
Recurrent Neural Networks (遞迴神經網路)
Face Datasets, Face Detection, Face Alignment, Face Landmark, Face Recognition, Face Identificatio
Human-Pose, Head-Pose, Hand-Pose, Object-Pose Estimation (姿態估算)
Image Matting, Semantics Segmentation, Human Part Segmentation, Instance Segmentation, Video Object Segmentation, Panopitc Segmentation.
Object Detection, Object Tracking
CNN, Image Classification
AI-chips, Edge-MCUs
AI簡介:演進, 應用, 新聞, 影響及趨勢, 未來
Image Processing (影像處理)