A Polish radio station is at the center of controversy after it replaced its hosts with Artificial Intelligence (AI) presenters.
OFF Radio Krakow, based in southern Poland, introduced three avatars in what it said was “the first experiment in Poland in which journalists…are virtual characters created by AI.”
The avatars were created in hopes of reaching a younger audience by using them to touch on cultural, art and social topics such as LGBTQ+ issues. — Read More
Daily Archives: October 25, 2024
HeyGen enables your digital twin to do Zoom calls for you
Video platform HeyGen has added a feature that it claims allows users to send AI-powered digital versions of themselves to Zoom meetings and other live interactions.
The avatars can join one or more meetings simultaneously, 24/7. They are designed to not only look and sound like the people they are representing, buy they will also think, talk and make decisions like them, according to HeyGen.
The HeyGen Interactive Avatar is equipped with OpenAI real-time voice integration, which allows it to hold an intelligent, efficient and timely conversation with any audience. — Read More
Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models
Consistency models (CMs) are a powerful class of diffusion-based generative models optimized for fast sampling. Most existing CMs are trained using discretized timesteps, which introduce additional hyperparameters and are prone to discretization errors. While continuous-time formulations can mitigate these issues, their success has been limited by training instability. To address this, we propose a simplified theoretical framework that unifies previous parameterizations of diffusion models and CMs, identifying the root causes of instability. Based on this analysis, we introduce key improvements in diffusion process parameterization, network architecture, and training objectives. These changes enable us to train continuous-time CMs at an unprecedented scale, reaching 1.5B parameters on ImageNet 512×512. Our proposed training algorithm, using only two sampling steps, achieves FID scores of 2.06 on CIFAR-10, 1.48 on ImageNet 64×64, and 1.88 on ImageNet 512×512, narrowing the gap in FID scores with the best existing diffusion models to within 10%. — Read More