Rick's Cafe AI 10:45 am on January 13, 2026
Tags: Performance ( 102 )

Use multiple models

The meta for getting the most out of AI in 2026.

… [I]t doesn’t feel like I could get away with just using one of these models without taking a substantial haircut in capabilities. This is a very strong endorsement for the notion of AI being jagged — i.e. with very strong capabilities spread out unevenly — while also being a bit of an unusual way to need to use a product. Each model is jagged in its own way. Through 2023, 2024, and the earlier days of modern AI, it quite often felt like there was always just one winning model and keeping up was easier. Today, it takes a lot of work and fiddling to make sure you’re not missing out on capabilities. — Read More

#performance

Rick's Cafe AI 5:20 pm on January 12, 2026
Tags: Strategy ( 485 )

When Google Locked the Door, Three MIT Students Picked the Lock

When Google locked AlphaFold 3 behind commercial restrictions, three MIT PhD students rebuilt it in four months. Now Boltz has $28M, a Pfizer partnership, and a bet that open-source can capture drug discovery infrastructure. — Read More

#strategy

Rick's Cafe AI 5:15 pm on January 12, 2026
Tags: Architecture ( 15 )

Agent-native Architectures

Software agents work reliably now. Claude Code demonstrated that a large language model (LLM) with access to bash and file tools, operating in a loop until an objective is achieved, can accomplish complex multi-step tasks autonomously.

The surprising discovery: A really good coding agent is actually a really good general-purpose agent. The same architecture that lets Claude Code refactor a codebase can let an agent organize your files, manage your reading list, or automate your workflows.

The Claude Code software development kit (SDK) makes this accessible. You can build applications where features aren’t code you write—they’re outcomes you describe, achieved by an agent with tools, operating in a loop until the outcome is reached.

This opens up a new field: software that works the way Claude Code works, applied to categories far beyond coding. — Read More

#architecture

Rick's Cafe AI 11:12 am on January 12, 2026
Tags: Reinforcement Learning ( 78 )

The AI Learned to Think on Its Own. Nobody Taught It How.

[In]January 2025, a Chinese startup that most Western engineers had never heard of publishes a research paper that shocks the AI world.

The claim: they trained a reasoning model as capable as OpenAI’s best, for a fraction of the cost. The method? They removed humans from the training loop entirely. No “reward model” (an auxiliary model that learns to predict what humans would prefer). No thousands of annotators paid to rate responses. Just a single signal: the answer is correct, or it isn’t. — Read More

#reinforcement-learning

Rick's Cafe AI 10:49 am on January 12, 2026
Tags: Strategy ( 485 )

AI & Humans: Making the Relationship Work

Leaders of many organizations are urging their teams to adopt agentic AI to improve efficiency, but are finding it hard to achieve any benefit. Managers attempting to add AI agents to existing human teams may find that bots fail to faithfully follow their instructions, return pointless or obvious results or burn precious time and resources spinning on tasks that older, simpler systems could have accomplished just as well.

The technical innovators getting the most out of AI are finding that the technology can be remarkably human in its behavior. And the more groups of AI agents are given tasks that require cooperation and collaboration, the more those human-like dynamics emerge.

Our research suggests that, because of how directly they seem to apply to hybrid teams of human and digital workers, the most effective leaders in the coming years may still be those who excel at understanding the timeworn principles of human management.

We have spent years studying the risks and opportunities for organizations adopting AI. Our 2025 book, Rewiring Democracy, examines lessons from AI adoption in government institutions and civil society worldwide. In it, we identify where the technology has made the biggest impact and where it fails to make a difference. Today, we see many of the organizations we’ve studied taking another shot at AI adoption—this time, with agentic tools. While generative AI generates, agentic AI acts and achieves goals such as automating supply chain processes, making data-driven investment decisions or managing complex project workflows. The cutting edge of AI development research is starting to reveal what works best in this new paradigm. — Read More

#strategy

Rick's Cafe AI 1:08 pm on January 10, 2026
Tags: NLP ( 486 )

AI suddenly develops a human skill on its own Scientists now officially confused, concerned, and considering therapy

People, take a stiff drink for this one, cause it’s going to be long, unhinged, and “why the hell is my toaster negotiating with my fridge” levels of existential blog.

Let me TL;DR this beast for ya.

In a plot twist no one saw coming, but everyone privately feared, our dear AI has decided to pick up a brand-new human skill all by itself, which is the skill of getting along in a group. — Read More

#nlp

Rick's Cafe AI 2:08 pm on January 9, 2026
Tags: China AI ( 136 )

8 plots that explain the state of open models

Starting 2026, most people are aware that a handful of Chinese companies are making strong, open AI models that are applying increasing pressure on the American AI economy.

While many Chinese labs are making models, the adoption metrics are dominated by Qwen (with a little help from DeepSeek). Adoption of the new entrants in the open model scene in 2025, from Z.ai, MiniMax, Kimi Moonshot, and others is actually quite limited. This sets up the position where dethroning Qwen in adoption in 2026 looks impossible overall, but there are areas for opportunity. In fact, the strength of GPT-OSS shows that the U.S. could very well have the smartest open models again in 2026, even if they’re used far less across the ecosystem. — Read More

#china-ai

Rick's Cafe AI 1:46 pm on January 9, 2026
Tags: China vs US ( 139 )

Chinese AI models have lagged the US frontier by 7 months on average since 2023

Since 2023, every model at the frontier of AI capabilities, as measured by the Epoch Capabilities Index, has been developed in the United States. Over that same period, Chinese models have trailed US capabilities by an average of seven months, with a minimum gap of four months and a maximum gap of 14. — Read More

#china-vs-us

Rick's Cafe AI 1:40 pm on January 9, 2026
Tags: Cyber ( 214 )

The ROI Problem in Attack Surface Management

Attack Surface Management (ASM) tools promise reduced risk. What they usually deliver is more information.

Security teams deploy ASM, asset inventories grow, alerts start flowing, and dashboards fill up. There is visible activity and measurable output. But when leadership asks a simple question, “Is this reducing incidents?” the answer is often unclear.

This gap between effort and outcome is the core ROI problem in attack surface management, especially when ROI is measured primarily through asset counts instead of risk reduction. — Read More

#cyber

Rick's Cafe AI 1:08 pm on January 2, 2026
Tags: China AI ( 136 )

mHC: Manifold-Constrained Hyper-Connections

Recently, studies exemplified by Hyper-Connections (HC) have extended the ubiquitous residual connection paradigm established over the past decade by expanding the residual stream width and diversifying connectivity patterns. While yielding substantial performance gains, this diversification fundamentally compromises the identity mapping property intrinsic to the residual connection, which causes severe training instability and restricted scalability, and additionally incurs notable memory access overhead. To address these challenges, we propose Manifold-Constrained Hyper-Connections (mHC), a general framework that projects the residual connection space of HC onto a specific manifold to restore the identity mapping property, while incorporating rigorous infrastructure optimization to ensure efficiency. Empirical experiments demonstrate that mHC is effective for training at scale, offering tangible performance improvements and superior scalability. We anticipate that mHC, as a flexible and practical extension of HC, will contribute to a deeper understanding of topological architecture design and suggest promising directions for the evolution of foundational models. — Read More