Research released in March 2025 introduced Long Context Tuning (LCT) , a training paradigm designed to expand the context window of single-shot video diffusion models.
In the practical creator space, "long content" refers to long-form videos (e.g., YouTube vlogs or podcasts) that are increasingly being broken down using AI tools like OpusClip .
: Most datasets for video-language models previously contained only short captions.