The Chinese companies building language models are set up as the perfect fast-followers for the technology, building on long-standing cultural traditions in education and work, along with subtly different approaches to building technology companies. When you look at the outputs, the latest, biggest models enabling agentic workflows, and the ingredients, excellent scientists, large-scale data, and accelerated computing, the Chinese and American labs look largely similar. The lasting differences emerge in how these are organized and conditioned.
long thought that a reason that the Chinese labs are so good at catching up and keeping up with the frontier is that they’re culturally aligned for this task, but without talking to people directly I felt like it wasn’t my place to attribute substantial influence to this hunch. Speaking with many wonderful, humble, and open scientists at the leading Chinese labs has crystallized a lot of my beliefs. — Read More
Tag Archives: China AI
DeepSeek V4—almost on the frontier, a fraction of the price
Chinese AI lab DeepSeek’s last model release was V3.2 (and V3.2 Speciale) last December. They just dropped the first of their hotly anticipated V4 series in the shape of two preview models, DeepSeek-V4-Pro and DeepSeek-V4-Flash.
Both models are 1 million token context Mixture of Experts. Pro is 1.6T total parameters, 49B active. Flash is 284B total, 13B active. They’re using the standard MIT license.
I think this makes DeepSeek-V4-Pro the new largest open weights model. — Read More
Mysterious ‘Hunter Alpha’ AI Goes Viral. Why Are Top Models Launching In Secret?
What is the Hunter Alpha AI Model? Hunter Alpha, a powerful artificial intelligence model, mysteriously appeared on the AI gateway platform OpenRouter recently. No one knows where it came from. It was described by the platform as a “stealth model”. There’s no official announcement or press release about this AI model, but it drew attention because of its specs of 1 trillion parameters, a 1 million token context window, and free access. — Read More
China leads the humanoid robot race — but the U.S. still has a shot
Since the start of the year, China’s humanoid robots have made waves at home and abroad — from the Consumer Electronics Show in Las Vegas to China’s Lunar New Year Spring Gala — fueling bold claims about a new industrial revolution that would make it impossible for the U.S. to catch up.
Chinese companies now dominate the humanoid robot market, capturing over 90% of global sales with thousands of units shipped last year. While Elon Musk maintains that Tesla will ultimately lead the industry, he recently acknowledged Chinese firms as his primary competition and noted that Tesla’s Optimus robots won’t be ready for launch until at least next year.
To unpack the claims and look beyond the viral robot performances, Lian Jye Su, chief analyst at tech consulting company Omdia and the author of its latest humanoid robotics report, spoke to Rest of World at a virtual event on February 25. — Read More
What Are Chinese People Vibecoding?
“Vibecoding” doesn’t lend itself to easy translation. For now, Chinese speakers call it 氛围编程 fènwéi biānchéng, 氛围 being “atmosphere”/”vibes” and 编程 being coding. This is an awkward expression because 氛围 usually refers to the atmosphere of a space or environment, and doesn’t have the connotation of care-free DIY that “vibe” does in colloquial American English. 氛围编程 sounds nonsensical as a phrase — something like “coding up an atmosphere.”
But we make do, and oftentimes writers simply use the English word. Developers, creatives, and entrepreneurs in China have been creating many interesting coding projects with AI tools over the past year, utilizing not only popular tools by Silicon Valley giants like Cursor and Claude Code, but also domestic models as Chinese AI companies increasingly compete in the coding-agent market.
Tinkering culture has no borders, and companies are cashing in. This is a roundup of reports from Chinese media on how vibecoding is changing the landscape of technology in China. — Read More
The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+
This is the third and final blog in a three-part series on China’s open source community’s historical advancements since January 2025’s “DeepSeek Moment.” The first blog on strategic changes and open artifact growth is available here, and the second blog on architectural and hardware shifts is available here.
In this third article, we examine paths and trajectories of prominent Chinese AI organizations, and posit future directions for open source.
For AI researchers and developers contributing to and relying on the open source ecosystem and for policymakers understanding the rapidly changing environment, due to intraorganizational and global community gains, open source is the dominant and popular approach for Chinese AI organizations for the near future. Openly sharing artifacts from models to papers to deployment infrastructure maps to a strategy with the goal of large-scale deployment and integration. — Read More
Inside China’s Real Advantage: Manufacturing at Scale
Observers often fixate on the most visible layer of China’s tech stack: consumer-facing conveniences like mobile payments, fifteen-minute food delivery, and dockless bikes. These can make for good investments — we regularly cover them at Tech Buzz China — but they are primarily business model innovations, increasingly familiar, and replicable with modest effort. In my opinion, they do not represent China’s true advantages, the ones that resist replication.
What proves far harder to replicate, and far more consequential, is the invisible layer: China’s manufacturing base. This is the part of the ecosystem that actually reshapes global supply chains, yet it remains the part most visitors never see and, in many cases, never think to see. — Read More
China’s Military Uses Hawk and Wolf Behavior to Train AI Weapon Swarms
On January 23, China’s National University of Defense Technology demonstrated something that’s reshaping how autonomous weapons work: a single operator supervising over 200 drones simultaneously during urban combat exercises. The swarm operated with minimal human input, relying on what the People’s Liberation Army calls “effect-based control,” designed to function even when communication signals are jammed.
The technology didn’t emerge from traditional programming. It came from watching hawks hunt.
Engineers at Beihang University, a military-linked institution, observed how hawks select vulnerable prey and trained defensive drones to replicate that behaviour, according to The Wall Street Journal. In parallel tests, attack drones mimicked pigeons to evade threats. The result: in a five-versus-five combat simulation, the hawk-trained drones eliminated all opponents in 5.3 seconds, according to a patent filed in April 2024. — Read More
China’s Z.ai claims it trained a model using only Huawei hardware
Chinese outfit Zhipu AI claims it trained a new model entirely using Huawei hardware, and that it’s the first company to build an advanced model entirely on Chinese hardware.
Zhipu, which styles itself Z.ai and runs a chatbot at that address, offers several models named General Language Model (GLM). On Wednesday the company announced GLM-Image, that it says employs “an independently developed ‘autoregressive + diffusion decoder’ hybrid architecture, which enables the joint generation of image and language models.” represents an important advance on the Nano Banana Pro image-generating AI. — Read More
8 plots that explain the state of open models
Starting 2026, most people are aware that a handful of Chinese companies are making strong, open AI models that are applying increasing pressure on the American AI economy.
While many Chinese labs are making models, the adoption metrics are dominated by Qwen (with a little help from DeepSeek). Adoption of the new entrants in the open model scene in 2025, from Z.ai, MiniMax, Kimi Moonshot, and others is actually quite limited. This sets up the position where dethroning Qwen in adoption in 2026 looks impossible overall, but there are areas for opportunity. In fact, the strength of GPT-OSS shows that the U.S. could very well have the smartest open models again in 2026, even if they’re used far less across the ecosystem. — Read More