Kimi K2.5

Artificial Analysis calls Kimi the new leading open weights model, ‘now closer than ever to the frontier’ behind only OpenAI, Anthropic and Google.

Kimi K2.5 gets to top some benchmarks: HLE-Full with tools (50%), BrowseComp with Agent Swarp (78%), OCRBench (92%), OmiDocBench 1.5 (89%), MathVista (90%) and InfoVQA (93%). It is not too far behind on AIME 2025 (96% vs. 100%), SWE-Bench (77% vs. 81%) and GPQA-Diamond (88% vs. 92%).

[B]enchmarks are highly useful, but easy to overinterpret.

Inference is cheap, and speed is similar to Gemini 3 Pro, modestly faster than Opus. — Read More

#performance

Enterprises Don’t Have an AI Problem. They Have an Architecture Problem

Over the last year, I keep hearing the same statements in meetings, reviews, and architecture forums:

“We’re doing AI.” “We have a chatbot now.” “We’ve deployed an agent.”

When I look a little closer, what most organizations really have is not enterprise AI. They have a tool.

Usually it is a chatbot, or a search assistant, or a workflow automation, or a RAG system. All of these are useful. I have built many of them myself. But none of these, by themselves, represent enterprise AI architecture.

AI is not a feature. AI is not a product.

AI is a new enterprise capability layer. And in large organizations, capability layers must be architected. — Read More

#strategy

Before ChatGPT, this simple machine changed everything

Today’s neural networks feel almost magical.
They write, see, reason, and talk to us like nothing before.

But all of this traces back to one extremely simple machine.

When this machine appeared in the late 1950s, it quietly changed how people thought about intelligence. — Read More

#artificial-intelligence