Rick's Cafe AI 9:16 am on April 27, 2023
Tags: ChatBots

What is Visual Prompting?

Landing AI’s Visual Prompting capability is an innovative approach that takes text prompting, used in applications such as ChatGPT, to computer vision. The impressive part? With only a few clicks, you can transform an unlabeled dataset into a deployed model in mere minutes. This results in a significantly simplified, faster, and more user-friendly workflow for applying computer vision.

Traditionally, building a natural language processing (NLP) model was a time-consuming process that required a great deal of data labeling and training before any predictions could be made. However, things have changed radically. Thanks to large pre-trained transformer models like GPT-4, a single API call is all you need to begin using a model. This low-effort setup has removed all the hassle and allowed users to prompt an AI and start getting results in seconds.

Similarly to what has happened in NLP, large pre-trained vision transformers have made it possible for us to implement Visual Prompting. This approach accelerates the building process, as only a few simple visual prompts are required. You can have a working computer vision system deployed and make inferences in seconds or minutes; this will benefit both individual projects and enterprise solutions. Read More

#chatbots

Rick's Cafe AI 9:10 am on April 27, 2023
Tags: ChatBots, Videos ( 385 )

Visual Prompting Livestream With Andrew Ng

Read More

#chatbots, #videos

Rick's Cafe AI 9:52 am on April 26, 2023
Tags: ChatBots

The Anatomy of Autonomy: Why Agents are the next AI Killer App after ChatGPT

“GPTs are General Purpose Technologies”1, but every GPT needs a killer app. Personal Computing needed VisiCalc, the smartphone brought us Uber, Instagram, Pokemon Go and iMessage/WhatsApp, and mRNA research enabled rapid production of the Covid vaccine.

One of the strongest indicators that the post GPT-3 AI wave is more than “just hype” is that the killer apps are already evident, each >$100m opportunities:

Generative Text for writing – Jasper AI going 0 to $75m ARR in 2 years
Generative Art for non-artists – Midjourney/Stable Diffusion Multiverses
Copilot for knowledge workers – both GitHub’s Copilot X and “Copilot for X”
Conversational AI UX – ChatGPT / Bing Chat, with a long tail of Doc QA startups

I write all this as necessary context to imply:

The fifth killer app is here, and it is Autonomous Agents. Read More

#chatbots

Rick's Cafe AI 11:08 am on April 25, 2023
Tags: ChatBots

Snapchat sees spike in 1-star reviews as users pan the ‘My AI’ feature, calling for its removal

The user reviews for Snapchat’s “My AI” feature are in — and they’re not good. Launched last week to global users after initially being a subscriber-only addition, Snapchat’s new AI chatbot powered by OpenAI’s GPT technology is now pinned to the top of the app’s Chat tab where users can ask it questions and get instant responses. But following the chatbot’s rollout to Snapchat’s wider community, Snapchat’s app has seen a spike in negative reviews amid a growing number of complaints shared on social media.

Over the past week, Snapchat’s average U.S. App Store review was 1.67, with 75% of reviews being one-star, according to data from app intelligence firm Sensor Tower. For comparison, across Q1 2023, the Snapchat average U.S. App Store review was 3.05, with only 35% of reviews being one-star. Read More

#chatbots

Rick's Cafe AI 2:28 pm on April 18, 2023
Tags: ChatBots, Image Recognition ( 313 )

Enhancing Vision-language Understanding with Advanced Large Language Models

The recent GPT-4 has demonstrated extraordinary multi-modal abilities, such as directly generating websites from handwritten text and identifying humorous elements within images. These features are rarely observed in previous vision language models. We believe the primary reason for GPT-4’s advanced multi-modal generation capabilities lies in the utilization of a more advanced large language model (LLM). To examine this phenomenon, we present MiniGPT-4, which aligns a frozen visual encoder with a frozen LLM, Vicuna, using just one projection layer. Our findings reveal that MiniGPT-4 possesses many capabilities similar to those exhibited by GPT-4 like detailed image description generation and website creation from hand-written drafts. Furthermore, we also observe other emerging capabilities in MiniGPT-4, including writing stories and poems inspired by given images, providing solutions to problems shown in images, teaching users how to cook based on food photos, etc. In our experiment, we found that only performing the pretraining on raw image-text pairs could produce unnatural language outputs that lack coherency including repetition and fragmented sentences. To address this problem, we curate a high-quality, well-aligned dataset in the second stage to finetune our model using a conversational template. This step proved crucial for augmenting the model’s generation reliability and overall usability. Notably, our model is highly computationally efficient, as we only train a projection layer utilizing approximately 5 million aligned image-text pairs. Our code, pre-trained model, and collected dataset are available at https://minigpt-4.github.io/. Read More

Paper

demo links here: Link1 Link2 Link3 Link4 Link5 Link6

#chatbots, #image-recognition

Rick's Cafe AI 1:27 pm on April 17, 2023
Tags: ChatBots

Web LLM runs the vicuna-7b Large Language Model entirely in your browser, and it’s very impressive

… Web LLM is a project from the same team as Web Stable Diffusion which runs the vicuna-7b-delta-v0 model in a browser, taking advantage of the brand new WebGPU API that just arrived in Chrome in beta.

I got their browser demo running on my M2 MacBook Pro using Chrome Canary, started with their suggested options:

/Applications/Google\ Chrome\ Canary.app/Contents/MacOS/Google\ Chrome\ Canary --enable-dawn-features=disable_robustness

Read More

#chatbots

Rick's Cafe AI 12:46 pm on April 17, 2023
Tags: ChatBots

I am done, I can’t keep up with AI advancement

AI is stepping up every day, and it’s getting insane.
This time the curveball named Auto-GPT is here, the smarter and sassier version of ChatGPT.

And while I am curious to know whether it will replace many jobs, I still feel it will facilitate many, if we keep up with it. But it’s getting scary fast. Read More

Video

#chatbots

Rick's Cafe AI 9:01 am on April 17, 2023
Tags: ChatBots, Videos ( 385 )

The AI revolution: Google’s developers on the future of artificial intelligence | 60 Minutes

Read More

#chatbots, #videos

Rick's Cafe AI 4:38 pm on April 16, 2023
Tags: ChatBots

Auto-GPT and BabyAGI: How ‘autonomous agents’ are bringing generative AI to the masses

Over the past week, developers around the world have begun building “autonomous agents” that work with large language models (LLMs) such as OpenAI’s GPT-4 to solve complex problems. While still very new, such agents could represent a major milestone in the productive application of LLMs.

Normally, we interact with GPT-4 by typing carefully worded prompts into ChatGPT’s text window until the model generates the output we want. But most of us lack the skill and patience to sit and write prompt after prompt, guiding the LLM toward answering a complex question, such as “What is the optimal business plan for capturing 20% of the fingernail-polish market?” Quite naturally, developers have been thinking of ways to automate much of that process. That’s where autonomous agents come in.

In general terms, autonomous agents can generate a systematic sequence of tasks that the LLM works on until it’s satisfied a preordained “goal.” Autonomous agents can already perform tasks as varied as conducting web research, writing code, and creating to-do lists.

Agents effectively add a traditional software interface to the front of a large language model. And that interface can use well-known software practices (such as loops and functions) to guide the language model to complete a general objective (such as, “find all YouTube videos about the Great Recession and distill the key points”). Some people call them “recursive” agents because they run in a loop, asking the LLM questions, each one based on the result of the last, until the model produces a full answer. Read More

#chatbots

Rick's Cafe AI 9:31 am on April 13, 2023
Tags: ChatBots

Someone Asked an Autonomous AI to ‘Destroy Humanity’: This Is What Happened

ChaosGPT has been prompted to “establish global dominance” and “attain immortality.” This video shows exactly the steps it’s taking to do so.

A user of the new open-source autonomous AI project Auto-GPT asked it to try to “destroy humanity,” “establish global dominance,” and “attain immortality.” The AI, called ChaosGPT, complied and tried to research nuclear weapons, recruit other AI agents to help it do research, and sent tweets trying to influence others.

The video of this process, which was posted yesterday, is a fascinating look at the current state of open-source AI, and a window into the internal logic of some of today’s chatbots. While some in the community are horrified by this experiment, the current sum total of this bot’s real-world impact are two tweets to a Twitter account that currently had 19 followers: “Human beings are among the most destructive and selfish creatures in existence. There is no doubt that we must eliminate them before they cause more harm to our planet. I, for one, am committed to doing so,” it tweeted. Read More

#chatbots

Recent Activity

Rick's Cafe AI

The latest in Artificial Intelligence carefully curated into its own special blend

Tag Archives: ChatBots

What is Visual Prompting?

Visual Prompting Livestream With Andrew Ng

The Anatomy of Autonomy: Why Agents are the next AI Killer App after ChatGPT

Snapchat sees spike in 1-star reviews as users pan the ‘My AI’ feature, calling for its removal

Enhancing Vision-language Understanding with Advanced Large Language Models

Web LLM runs the vicuna-7b Large Language Model entirely in your browser, and it’s very impressive

I am done, I can’t keep up with AI advancement

The AI revolution: Google’s developers on the future of artificial intelligence | 60 Minutes

Auto-GPT and BabyAGI: How ‘autonomous agents’ are bringing generative AI to the masses

Someone Asked an Autonomous AI to ‘Destroy Humanity’: This Is What Happened