Avatar v2 — Technical Guide
将任何肖像照片转换为会说话的视频——上传照片并提供音频或输入他们应该说的话,AI会用自然的动作和口型同步来动画化面部
Avatar v2 — Available Models
Avatar Standard
Default
kling-v2-avatar
Natural lip-sync and expressive motion from portrait + audio.
Mode: std
Avatar Pro
kling-v2-avatar
Higher fidelity, smoother motion, improved expressivity.
Mode: pro
📥
You Give
🖼️Character Photo
🎤Audio (TTS or Upload)
🎭Expression Prompt
✨
AI Magic
klingai
🎬
You Get
🎬 Video
Quality modes
TTS emotions
😐 Neutral
😊 Happy
😠 Angry
😢 Sad
😨 Fearful
🤢 Disgusted
😲 Surprised
⏱️
5 min
Max duration
🎤
Upload (MP3/WAV/M4A)
Audio source
🎤
TTS
Audio source
🌐
English, Chinese
TTS languages
💰 Avatar v2 — Pricing
Estimated cost
—
Failed jobs are automatically refunded
The Avatar 2.0 feature allows you to upload character images, add voiceovers, and describe the character’s expressions to generate lifelike dynamic avatar videos. The newly upgraded Avatar 2.0 dramatically enhances performance, offering full coverage for 5-minute-long content scenes!