Rick's Cafe AI 7:34 am on July 12, 2024
Tags: Audio

Generating audio for video

Video-to-audio research uses video pixels and text prompts to generate rich soundtracks

Video generation models are advancing at an incredible pace, but many current systems can only generate silent output. One of the next major steps toward bringing generated movies to life is creating soundtracks for these silent videos.

Today, we’re sharing progress on our video-to-audio (V2A) technology, which makes synchronized audiovisual generation possible. V2A combines video pixels with natural language text prompts to generate rich soundscapes for the on-screen action. — Read More

Read the Paper

#audio

Rick's Cafe AI 12:19 pm on May 21, 2024
Tags: Audio, Ethics ( 85 )

OpenAI pauses use of “Sky” voice after threat of legal action.

OpenAI has paused a voice mode option for ChatGPT-4o, Sky, after backlash accusing the AI company of intentionally ripping off Scarlett Johansson’s critically acclaimed voice-acting performance in the 2013 sci-fi film Her.

In a blog defending its casting decision for Sky, OpenAI went into great detail explaining its process for choosing the individual voice options for its chatbot. But ultimately, the company seemed pressed to admit that Sky’s voice was just too similar to Johansson’s to keep using it, at least for now. — Read More

#audio, #ethics

Rick's Cafe AI 1:25 pm on May 4, 2024
Tags: Audio

Randy Travis’s New Song Recreates His Voice With AI Technology

Randy Travis, who lost much of his speech in a 2013 stroke, used artificial intelligence technology to clone his voice for his first recording in more than a decade.

Travis, his longtime producer Kyle Lehning, Travis’s wife Mary, and Warner Music Nashville co-chair and co-president Cris Lacy spoke with CBS Sunday Morning to detail how AI helped create “Where That Came From,” Travis’s new song that released on Friday. The full report will air Sunday. — Read More

#audio

Rick's Cafe AI 1:21 pm on May 4, 2024
Tags: Audio, Videos ( 375 )

Washed Out “The Hardest Part” – Made with OpenAI’s Sora

Read More

#audio, #videos

Rick's Cafe AI 8:06 am on April 25, 2024
Tags: Audio

How to make music with AI using Udio

There’s something quite alluring about trying to create art in a form you’re less familiar with. AI music is the latest canvas in this space.

While we can easily sketch a drawing with a pen and piece of paper at home, not all of us have instruments lying around or the skills to use them.

Generative AI gets rid of those hurdles and tools like Udio, Stable Audio, Cassette AI and Suno allow us to dip our toes into music production. Prior experience is not required. Furthermore, Udio seems to be on to something in that it is able to combine a simple user experience with pretty decent results. — Read More

#audio

Rick's Cafe AI 5:23 pm on April 22, 2024
Tags: Audio

Drake Uses AI Tupac and Snoop Dogg Vocals on ‘Taylor Made Freestyle

The beef between Drake and what continues to be a strong sect of the hip-hop community grows deeper. On Friday night (April 19), the rapper released a song on his social media entitled “Taylor Made Freestyle,” which uses AI vocals from Tupac Shakur and Snoop Dogg on a stopgap between diss records as he awaits Kendrick Lamar’s reply to his freshly released “Push Ups.” — Read More

#audio

Rick's Cafe AI 7:57 am on April 3, 2024
Tags: Audio, VFX ( 186 )

200+ Artists Urge Tech Platforms: Stop Devaluing Music

STOP DEVALUING MUSIC. An open letter signed by over 200 musicians calls on AI developers, tech companies, platforms and digital music services to stop using AI to “infringe upon and devalue the rights of human artists.” — Read More

#audio, #vfx

Rick's Cafe AI 7:56 am on April 1, 2024
Tags: Audio

OpenAI built a voice cloning tool, but you can’t use it… yet

As deepfakes proliferate, OpenAI is refining the tech used to clone voices — but the company insists it’s doing so responsibly.

Today marks the preview debut of OpenAI’s Voice Engine, an expansion of the company’s existing text-to-speech API. Under development for about two years, Voice Engine allows users to upload any 15-second voice sample to generate a synthetic copy of that voice. But there’s no date for public availability yet, giving the company time to respond to how the model is used and abused.

“We want to make sure that everyone feels good about how it’s being deployed — that we understand the landscape of where this tech is dangerous and we have mitigations in place for that,” Jeff Harris, a member of the product staff at OpenAI, told TechCrunch in an interview. — Read More

#audio

Rick's Cafe AI 2:30 pm on March 8, 2024
Tags: Audio, Videos ( 375 )

Mikey Shulman: Suno and the Sound of AI Music

Read More

#audio, #videos

Rick's Cafe AI 10:16 am on February 20, 2024
Tags: Audio, VFX ( 186 )

If you thought Sora was impressive now watch it with AI generated sound from ElevenLabs

Artificial intelligence speech startup ElevenLabs offered an insight into what its planning to release in the future, adding sound effects to AI generated video for the first time.

Best known for its near human-like text-to-speech and synthetic voice services, ElevenLabs added artificially generated sound effects to videos produced using OpenAI’s Sora.

OpenAI unveiled its impressive Sora text-to-video artificial intelligence model last week, showcasing some of the most realistic, consistent and longest AI generated video to date. — Read More

#audio, #vfx

Rick's Cafe AI

The latest in Artificial Intelligence carefully curated into its own special blend

Tag Archives: Audio

Generating audio for video

OpenAI pauses use of “Sky” voice after threat of legal action.

Randy Travis’s New Song Recreates His Voice With AI Technology

Washed Out “The Hardest Part” – Made with OpenAI’s Sora

How to make music with AI using Udio

Drake Uses AI Tupac and Snoop Dogg Vocals on ‘Taylor Made Freestyle

200+ Artists Urge Tech Platforms: Stop Devaluing Music

OpenAI built a voice cloning tool, but you can’t use it… yet

Mikey Shulman: Suno and the Sound of AI Music

If you thought Sora was impressive now watch it with AI generated sound from ElevenLabs