arXiv AI recent: CineOrchestra: Unified Entity-Centric Conditioning for Cinematic Video Generation
Researchers introduced CineOrchestra, a unified video diffusion model that simultaneously controls subjects, events, cameras, and shot transitions in cinematic video generation.
CineOrchestra treats heterogeneous cinematic elements as entities acting over specific temporal intervals and expresses them through a shared structure of entity‑centric conditioning primitives, augmented with reference images for visual entities. The model solves the positional encoding challeng...