Teaching LLMs to reason like Bayesians

AI systems based on large language models (LLMs) are increasingly used as agents that interact with users and the world. To do this successfully, LLMs need to construct internal representations of the world and estimate the probability that each of these representations is accurate. Take personalized recommendations, for example: the LLM needs to gradually infer the user’s preferences from their choices over the course of multiple interactions.

Bayesian inference defines the optimal way to perform such updates. By implementing this strategy, LLMs could optimize user interactions by updating their estimates of the user’s preferences as new info about the user arrives. But without specific training, LLMs often default to simple heuristics — like assuming everyone wants the cheapest option — instead of inferring a specific user’s unique preferences.

In “Bayesian teaching enables probabilistic reasoning in large language models”, we teach the LLMs to reason in a Bayesian manner by training them to mimic the predictions of the Bayesian model, which defines the optimal way to reason about probabilities. We find that this approach not only significantly improves the LLM’s performance on the particular recommendation task on which it is trained, but also enables generalization to other tasks. This suggests that this method teaches the LLM to better approximate Bayesian reasoning. More generally, our results indicate that LLMs can effectively learn reasoning skills from examples and generalize those skills to new domains. — Read More

#training

China leads the humanoid robot race — but the U.S. still has a shot

Since the start of the year, China’s humanoid robots have made waves at home and abroad — from the Consumer Electronics Show in Las Vegas to China’s Lunar New Year Spring Gala — fueling bold claims about a new industrial revolution that would make it impossible for the U.S. to catch up.

Chinese companies now dominate the humanoid robot market, capturing over 90% of global sales with thousands of units shipped last year. While Elon Musk maintains that Tesla will ultimately lead the industry, he recently acknowledged Chinese firms as his primary competition and noted that Tesla’s Optimus robots won’t be ready for launch until at least next year.

To unpack the claims and look beyond the viral robot performances, Lian Jye Su, chief analyst at tech consulting company Omdia and the author of its latest humanoid robotics report, spoke to Rest of World at a virtual event on February 25. — Read More

#china-ai, #robotics

The Top 100 Gen AI Consumer Apps — 6th Edition

Three years ago, we published the first edition of this list with a simple goal: identify which generative AI products were actually getting used by mainstream consumers. At the time, the distinction between “AI-first” companies and everything else was clear. ChatGPT, Midjourney, and Character.AI were purpose-built around foundation models. The rest of the software world was still figuring out what to do with the technology.

That distinction no longer holds. …From this edition onward, we’re broadening the aperture to include any consumer product where generative AI has become a core part of the experience — including CapCut, Canva, Notion, Picsart, Freepik, and Grammarly. The result is what we believe is a more accurate picture of how people actually use AI, though the bulk of the top products continue to be AI-native. — Read More

#strategy