A cutting-edge solution for fine-grained control in video editing, leveraging space-time attention mechanisms.
更新时间:2025-03-02 18:58:34
VideoGrain is an advanced method designed to enhance video editing by providing fine-grained control over video content. It addresses challenges like semantic misalignment and feature coupling within the diffusion model by using space-time attention mechanisms. This zero-shot approach allows for class-level, instance-level, and part-level video editing with impressive precision.
Using VideoGrain involves modulating both cross-attention and self-attention within the diffusion model. In the cross-attention phase, local prompts are paired with their respective regions to improve text-to-region control. For self-attention, VideoGrain amplifies the attention within a region and reduces cross-region interference, allowing for greater feature separation and control over the video output.
VideoGrain目前提供免费访问与实验版,在未来可能会发布商业版,具体价格信息请参见官方更新。
VideoGrain由悉尼科技大学的1ReLER实验室与浙江大学的CCAI共同开发。
目前未提供专用的电子邮件联系方式,更多信息可通过实验室和学院网站联系。
社交媒体:Twitter: @knightyxp, Instagram: @knightyxp