Agents
Agents
Agents
Vision Language Model / Multimodal Large Language Model
Introduction to LLM
Vision-Language-Action model
Reinforcement-Learning for Robot Dexity
Reinforcement-Learning Gym for Robot
Reinforcement Learning
Text-to-Music, Text-to-Song
Image-to-Video, Text-to-Video, Audio-to-Video
Text-to-Image, Text-to-3D, Image-to-3D
Text-to-Speech, Voice Cloning, Speech Seperation, ASR
Style Transfer, Variational AutoEncoder, Generative Adversarial Network (生成對抗網路)
Recurrent Neural Networks (遞迴神經網路)
Face Datasets, Face Detection, Face Alignment, Face Landmark, Face Recognition, Face Identificatio
Human-Pose, Head-Pose, Hand-Pose, Object-Pose Estimation (姿態估算)
Image Matting, Semantics Segmentation, Human Part Segmentation, Instance Segmentation, Video Object Segmentation, Panopitc Segmentation.
Object Detection (物件偵測)
Image Classification (影像分類)
Convolutional Neural Networks (卷積層神經網路)
Image Processing (影像處理)
Colab, Notepad++, Git-for-Windows, Python3-for-Windows, LLM & ComfyUI
AI-chips, Edge-AI MCUs
AI簡介:演進, 應用, 新聞, 影響, 未來