DynVFX: Augmenting Real Videoswith Dynamic Content

We present a method for augmenting real-world videos with newly generated dynamic content. Given an input video and a simple user-provided text instruction describing the desired content, our method synthesizes dynamic objects or complex scene effects that naturally interact with the existing scene over time. The position, appearance, and motion of the new content are seamlessly integrated into the original footage while accounting for camera motion, occlusions, and interactions with other dynamic objects in the scene, resulting in a cohesive and realistic output video. We achieve this via a zero-shot, training-free framework that harnesses a pre-trained transformer-based text-to-video diffusion model to synthesize the new content and a pre-trained Vision Language Model to envision the augmented scene in detail. Specifically, we introduce a novel sampling-based method that manipulates features within the attention mechanism, enabling accurate localization and seamless integration of the new content while preserving the integrity of the original scene. Our method is fully automated, requiring only a simple user instruction. We demonstrate its effectiveness on a wide range of edits applied to real-world videos, encompassing diverse objects and scenarios involving both camera and object motion. — Read More

#vfx

Deep Dive into LLMs like ChatGPT

Read More

#videos

How DeepSeek stacks up when citing news publishers

Over the past two weeks, DeepSeek has made a splash in the AI industry. On January 20, the Chinese startup released its new open source model, DeepSeek-R1, which beat competitors like OpenAI’s o1 on several important performance benchmarks, despite costing a fraction of the price to develop.

In the DeepSeek hype cycle, however, little attention has been paid to the company’s approach to news publishers. When it comes to the model’s high performance, it’s worth asking if that extends to the model’s ability to accurately cite and attribute its news sources. And while DeepSeek is turning heads by hurdling over cost barriers to train its foundation model, does that model actually consider the intellectual property of media companies?

… Last summer, I published a story showing that ChatGPT, OpenAI’s chatbot, regularly hallucinated URLs for at least 10 of its news partners’ websites.  … I conducted a similar round of tests with DeepSeek’s chatbot, using both its website and mobile app. I prompted the model to share details on dozens of original investigations by major news outlets and to share links to those stories. A few things jumped out in my tests. Most notably, the chatbot readily acknowledged that sharing the contents of these news articles could violate copyright and skirt subscription paywalls. — Read More

#news-summarization