Rick's Cafe AI 1:40 pm on August 22, 2023
Tags: Image Recognition

Stable Diffusion -XL 1.0-base

SDXL consists of an ensemble of experts pipeline for latent diffusion: In a first step, the base model is used to generate (noisy) latents, which are then further processed with a refinement model (available here: https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-1.0/) specialized for the final denoising steps. Note that the base model can be used as a standalone module.

Alternatively, we can use a two-stage pipeline as follows: First, the base model is used to generate latents of the desired output size. In the second step, we use a specialized high-resolution model and apply a technique called SDEdit (https://arxiv.org/abs/2108.01073, also known as “img2img”) to the latents generated in the first step, using the same prompt. This technique is slightly slower than the first one, as it requires more function evaluations. — Read More

Source code is available at https://github.com/Stability-AI/generative-models .

#image-recognition

Rick's Cafe AI 3:19 pm on July 21, 2023
Tags: Big7 ( 271 ), Image Recognition, Videos ( 383 )

Google’s NEW TAPIR AI Features Have Everyone SHOCKED!

Read More

Read the Paper

#big7, #image-recognition, #videos

Rick's Cafe AI 3:01 pm on July 17, 2023
Tags: Big7 ( 271 ), Image Recognition

Meta claims its new art-generating model is best-in-class

… Today, Meta announced CM3Leon (“chameleon” in clumsy leetspeak), an AI model that the company claims achieves state-of-the-art performance for text-to-image generation. CM3Leon is also distinguished by being one of the first image generators capable of generating captions for images, laying the groundwork for more capable image-understanding models going forward, Meta says.

“With CM3Leon’s capabilities, image generation tools can produce more coherent imagery that better follows the input prompts,” Meta wrote in a blog post shared with TechCrunch earlier this week. “We believe CM3Leon’s strong performance across a variety of tasks is a step toward higher-fidelity image generation and understanding.” — Read More

#image-recognition, #big7

Rick's Cafe AI 1:23 pm on June 5, 2023
Tags: Big7 ( 271 ), Image Recognition

StyleDrop: Text-To-Image Generation in Any Style

We present StyleDrop that enables the generation of images that faithfully follow a specific style, powered by Muse, a text-to-image generative vision transformer. StyleDrop is extremely versatile and captures nuances and details of a user-provided style, such as color schemes, shading, design patterns, and local and global effects. StyleDrop works by efficiently learning a new style by fine-tuning very few trainable parameters (less than 1% of total model parameters), and improving the quality via iterative training with either human or automated feedback. Better yet, StyleDrop is able to deliver impressive results even when the user supplies only a single image specifying the desired style. An extensive study shows that, for the task of style tuning text-to-image models, Styledrop on Muse convincingly outperforms other methods, including DreamBooth and Textual Inversion on Imagen or Stable Diffusion. — Read More

#big7, #image-recognition

Rick's Cafe AI 1:55 pm on May 31, 2023
Tags: Image Recognition

Paragraphica – Context to image (AI) camera

Created by Bjørn Karmann, Paragraphica is a camera that utilizes location data and AI to visualize a “photo” of a specific place and moment. The camera exists both as a physical prototype and an online camera that you can try. — Read More

#image-recognition

Rick's Cafe AI 11:49 am on May 20, 2023
Tags: Image Recognition

This AI-Powered, Point-Based Photo Manipulation System is Wild

Researchers have developed a point-based image manipulation system that uses generative artificial intelligence (AI) technology to allow users to precisely control the pose, shape, expression, and layout of objects.

The research outlines how users can control generative adversarial networks (GANs) with intuitive graphical control. The technology is called DragGAN. — Read More

#image-recognition

Rick's Cafe AI 1:11 pm on May 18, 2023
Tags: Image Recognition

StableStudio is Stability AI’s latest commitment to open-source AI

Stability AI has announced StableStudio, a new open-source variant of its DreamStudio AI text-to-image web app.

Stability AI is releasing an open-source version of DreamStudio, a commercial interface for the company’s AI image generator model, Stable Diffusion. In a press statement on Wednesday, Stability AI said the new release — dubbed StableStudio — “marks a fresh chapter” for the platform and will serve as a showcase for the company’s “dedication to advancing open-source development.” — Read More

#image-recognition

Rick's Cafe AI 9:10 am on May 16, 2023
Tags: Image Recognition

Stability AI releases an open source text-to-animation tool

You’ve heard of text-to-image, but have you heard of text-to-animation?

From anime to childhood classics, animations have brought stories to life by combining still images. Now, with just a text prompt, you can generate your own animations using AI.

On Thursday, Stability AI, the AI company that created Stable Diffusion, unveiled a text-to-animation tool that allows developers and artists to use Stable Diffusion models to generate animations. — Read More

#image-recognition

Rick's Cafe AI 12:09 pm on May 13, 2023
Tags: Image Recognition

Google’s open-source AI tool let me play my favorite Dreamcast game with my face

Project Gameface is ready to install as a Windows app that makes gaming more accessible using only your webcam.

While Wednesday’s Google I/O event largely hyped the company’s biggest AI initiatives, the company also announced updates to the machine learning suite that powers Google Lens and Google Meet features like object tracking and recognition, gesture control, and of course, facial detection. The newest update enables app developers to, among other things, create Snapchat-like face filters and hand tracking, with the company showing off a GIF that’s definitely not a Memoji.

This update underpins a special project announced during the I/O developer keynote: an open-source accessibility application called Project Gameface, which lets you play games… with your face. During the keynote, Google played a very Wes Anderson-esque mini-documentary revealing a tragedy that prompted the company to design Gameface. — Read More

#image-recognition

Rick's Cafe AI 4:22 pm on May 9, 2023
Tags: Image Recognition

MidJourney Has Competition (And It’s Free To Use)!

Read More

#image-recognition

Rick's Cafe AI

The latest in Artificial Intelligence carefully curated into its own special blend

Tag Archives: Image Recognition

Stable Diffusion -XL 1.0-base

Google’s NEW TAPIR AI Features Have Everyone SHOCKED!

Meta claims its new art-generating model is best-in-class

StyleDrop: Text-To-Image Generation in Any Style

Paragraphica – Context to image (AI) camera

This AI-Powered, Point-Based Photo Manipulation System is Wild

StableStudio is Stability AI’s latest commitment to open-source AI

Stability AI releases an open source text-to-animation tool

Google’s open-source AI tool let me play my favorite Dreamcast game with my face

MidJourney Has Competition (And It’s Free To Use)!