First Neuralink patient can control a computer mouse by thinking, claims Elon Musk

The first human being to receive a brain chip from Elon Musk’s Neuralink can apparently control a computer mouse just by thinking, according to Musk.

…”Progress is good, and the patient seems to have made a full recovery, with no ill effects that we are aware of,” Musk said. “Patient is able to move a mouse around the screen by just thinking.”

…Last month, Musk shared in a post on X that Neuralink had successfully performed the transplant surgery on a human for the first time on Jan. 28. — Read More

#human

Gemma: Introducing new state-of-the-art open models

At Google, we believe in making AI helpful for everyone. We have a long history of contributing innovations to the open community, such as with TransformersTensorFlowBERTT5JAXAlphaFold, and AlphaCode. Today, we’re excited to introduce a new generation of open models from Google to assist developers and researchers in building AI responsibly.

Gemma is a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models. Developed by Google DeepMind and other teams across Google, Gemma is inspired by Gemini, and the name reflects the Latin gemma, meaning “precious stone.” Accompanying our model weights, we’re also releasing tools to support developer innovation, foster collaboration, and guide responsible use of Gemma models. — Read More

#devops

How AI is changing gymnastics judging

There was one individual Olympic spot left. According to the intricate set of rules governing who gets slots for the games, it would come down to who placed highest in the high bar final: Croatia’s Tin Srbić or Brazil’s Arthur Nory Mariano.

They were at the 2023 World Championships in Antwerp, Belgium, last October. Mariano went first. He fell during his routine, giving Srbić some wiggle room. He didn’t need it, though: Srbić completed a clean routine, with Tkachev connections and a double-twisting double layout that he stuck cold; at the end of his routine, he pumped his fists in the air in celebration. He’d qualified for the 2024 Paris Olympics. 

But when his score came in—a 14.500—Srbić thought the judges had made a mistake, one that could cost him a medal at Worlds. He needed to decide if he wanted to make a challenge.  

… These championships were the first time the technology, formally known as the Judging Support System, or JSS, had been used on every apparatus in a gymnastics competition—and its first use in a competition that could make or break an athlete’s Olympic dreams. While the AI judging system did not replace human judges—rather, it was available to help judges review routines in case of an inquiry or a “blocked score”—it still marked a watershed moment for the sport that was years in the making.  — Read More

#augmented-intelligence

Latte: Latent Diffusion Transformer for Video Generation

We propose a novel Latent Diffusion Transformer, namely Latte, for video generation. Latte first extracts spatio-temporal tokens from input videos and then adopts a series of Transformer blocks to model video distribution in the latent space. In order to model a substantial number of tokens extracted from videos, four efficient variants are introduced from the perspective of decomposing the spatial and temporal dimensions of input videos. To improve the quality of generated videos, we determine the best practices of Latte through rigorous experimental analysis, including video clip patch embedding, model variants, timestep-class information injection, temporal positional embedding, and learning strategies. Our comprehensive evaluation demonstrates that Latte achieves state-of-the-art performance across four standard video generation datasets, i.e., FaceForensics, SkyTimelapse, UCF101, and Taichi-HD. In addition, we extend Latte to text-to-video generation (T2V) task, where Latte achieves comparable results compared to recent T2V models. We strongly believe that Latte provides valuable insights for future research on incorporating Transformers into diffusion models for video generation. — Read More

#nlp, #image-recognition