Walking past the countless photos of Holocaust survivors and victims at Warsaw’s POLIN Museum of the History of Polish Jews in 2016, New York-native Daniel Patt was haunted by the possibility that he was passing the faces of his own relatives without even knowing it.
For Patt, a 40-year-old software engineer now working for Google, that sort of conundrum presented the potential for a creative solution. And so he set to work creating and developing From Numbers to Names (N2N), an artificial intelligence-driven facial recognition platform that can scan through photos from prewar Europe and the Holocaust, linking them to people living today. Read More
Tag Archives: Image Recognition
Tackling multiple tasks with a single visual language model
One key aspect of intelligence is the ability to quickly learn how to perform a new task when given a brief instruction. For instance, a child may recognise real animals at the zoo after seeing a few pictures of the animals in a book, despite differences between the two. But for a typical visual model to learn a new task, it must be trained on tens of thousands of examples specifically labelled for that task. If the goal is to count and identify animals in an image, as in “three zebras”, one would have to collect thousands of images and annotate each image with their quantity and species. This process is inefficient, expensive, and resource-intensive, requiring large amounts of annotated data and the need to train a new model each time it’s confronted with a new task. As part of DeepMind’s mission to solve intelligence, we’ve explored whether an alternative model could make this process easier and more efficient, given only limited task-specific information.
Today, in the preprint of our paper, we introduce Flamingo, a single visual language model (VLM) that sets a new state of the art in few-shot learning on a wide range of open-ended multimodal tasks. This means Flamingo can tackle a number of difficult problems with just a handful of task-specific examples (in a “few shots”), without any additional training required. Flamingo’s simple interface makes this possible, taking as input a prompt consisting of interleaved images, videos, and text and then output associated language. Read More
Object Detection State of the Art 2022
Object detection has been a hot topic ever since the boom of Deep Learning techniques. This article goes over the most recent state of the art object detectors.
First we will start with an introduction to the topic of object detection itself and it’s key metrics.
The evolution of object detectors began with Viola Jones detector which was used for detection in real-time. Traditionally, object detection algorithms used hand-crafted features to capture relevant information from images and a structured classifier to deal with spatial structures. Read More
AI program DALL-E mini prompts some truly cursed images
If you’ve seen some slightly distorted images on Twitter recently, it’s not just reality continuing to collapse on itself. An open AI program called DALL-E mini has overtaken Twitter in the last week, churning out a surreal stream of warped art.
DALL-E mini is developer Boris Dayma’s take on the the separate DALL-E program, which was released earlier this year. It produces a series of images based on text prompts, like the original program. Read More
The AI that creates any picture you want, explained
Google’s New AI: Flying Through Virtual Worlds!
But is it art, Ma’am? Robot’s platinum jubilee Queen portrait unveiled
Humanoid artist Ai-Da pays tribute to monarch with painting but critic calls it ‘a cynical, transparent con’
At first glance, the Queen could be wearing a tin hat with camouflage netting set against a thunderous sky. A commentary on the inevitable conflicts and turbulence that took place during her 70-year reign, perhaps. Or a thoughtful juxtaposition of stability and instability.
But no, it seems that Ai-Da, the robot artist who painted the Queen’s portrait to mark her platinum jubilee, was simply paying tribute to “an amazing human being”. The monarch’s trademark pearls and bold colours, along with a stoic facial expression, are the standout features of Algorithm Queen, which was unveiled on Friday. Read More
The dark secret behind those cute AI-generated animal images
Another month, another flood of weird and wonderful images generated by an artificial intelligence. In April, OpenAI showed off its new picture-making neural network, DALL-E 2, which could produce remarkable high-res images of almost anything it was asked to. It outstripped the original DALL-E in almost every way.
Now, just a few weeks later, Google Brain has revealed its own image-making AI, called Imagen. And it performs even better than DALL-E 2: it scores higher on a standard measure for rating the quality of computer-generated images, and the pictures it produced were preferred by a group of human judges.
“We’re living through the AI space race!” one Twitter user commented. “The stock image industry is officially toast,” tweeted another. Read More
Mastercard launches tech that lets you pay with your face or hand in stores
Mastercard on Tuesday launched a program that allows retailers to offer biometric payment methods, like facial recognition and fingerprint scanning.
Users can authenticate a payment by showing their face or the palm of their hand instead of swiping their card.
The technology could one day help with the development of payments infrastructure for the “metaverse,” an executive said. Read More
New method detects deepfake videos with up to 99% accuracy
Computer scientists at UC Riverside can detect manipulated facial expressions in deepfake videos with higher accuracy than current state-of-the-art methods. The method also works as well as current methods in cases where the facial identity, but not the expression, has been swapped, leading to a generalized approach to detect any kind of facial manipulation. The achievement brings researchers a step closer to developing automated tools for detecting manipulated videos that contain propaganda or misinformation.
Developments in video editing software have made it easy to exchange the face of one person for another and alter the expressions on original faces. As unscrupulous leaders and individuals deploy manipulated videos to sway political or social opinions, the ability to identify these videos is considered by many essential to protecting free democracies. Methods exist that can detect with reasonable accuracy when faces have been swapped. But identifying faces where only the expressions have been changed is more difficult and to date, no reliable technique exists. Read More