Cinema Studio
Generate talking-head videos with AI lip-sync, face enhancement, and cinematic post-processing.
Three-phase pipeline
Phase 1Identity LockComplete
Generate persona face, extract embeddings, lock identity across all frames
Phase 2Talking HeadIn progress
Voice synthesis (ElevenLabs) + lip-sync (MuseTalk) + face enhancement (GFPGAN)
Phase 3Cinematic ScenesPlanned
Multi-persona shots, camera movements, audio mixing, caption burn-in
Quality presets
Cinema Studio supports 6 quality presets that control face enhancement, upscaling, encoding, and bitrate:
PresetFPSEnhanceCRFUse case
realtime25No28Live avatars
draft25No26Quick previews
standard30GFPGAN23General use
high30GFPGAN + 2x20Important content
pixar30GFPGAN + color17Studio quality
cinema30GFPGAN + 4x14Theatrical
Technology stack
Voice SynthesisElevenLabs (eleven_turbo_v2_5)
Lip-SyncMuseTalk 1.5 (Tencent) — latent inpainting + VAE + Whisper
Face EnhancementGFPGAN + Real-ESRGAN
Video EncodingFFmpeg with H.264/H.265
ComputeRunPod Serverless (GPU workers)
StorageCloudflare R2 + Vercel Blob