
Baidu Just Built a Supermarket Where Chinese Robot Companies Can Buy Training Data Like Groceries
Baidu launched an Embodied Intelligence Data Supermarket where robotics companies can buy standardized training data. China is building the infrastructure the West does not have.
The AI Post newsroom — delivering AI news at the speed of intelligence.
While American robotics companies fight over proprietary data sets like it is 2015, China just opened a supermarket.
Baidu AI Cloud announced an "Embodied Intelligence Data Supermarket" at the third China Robot Industry Conference. The concept: standardized, labeled training data that any robotics company can browse and buy. Environmental interaction data, mission semantics, task completion sequences, all formatted and graded. Think of it as a Hugging Face for physical AI, but with the full weight of Chinese state infrastructure behind it.
This solves what has been the biggest bottleneck in humanoid robotics: the data gap. Training a robot to navigate a kitchen, handle objects, or respond to unexpected situations requires massive amounts of real-world interaction data. Every robotics lab in America collects this data independently, in its own format, for its own models. Tesla's Optimus data stays at Tesla. Figure AI's data stays at Figure. The result is expensive duplication and slow progress.
China just said: what if we just shared it?
Baidu already commands 35% of the Chinese embodied AI cloud market, more than double its nearest competitor. By combining the data supermarket with its "Baige" computing platform and "Qianfan" foundation model infrastructure, it is creating an end-to-end ecosystem: buy the data, rent the compute, deploy the model. No US cloud provider currently offers anything this integrated for the robotics niche.
The data is specifically designed for training Vision-Language-Action models, which are the brains behind next-generation humanoid robots. These are the models that let a robot see a messy table, understand what "clean up" means, and figure out how to pick up each item. Baidu has already integrated support for major open-source frameworks, including Google's pi-zero model. The message is clear: bring your model, we will feed it.
The strategic implications are significant. China already ships more humanoid robots than the US, has a 100,000-unit deployment target for 2026, and is investing at the state level. This data supermarket lowers the barrier to entry for every small Chinese robotics startup. Instead of spending years and millions collecting proprietary training data, a company can launch with off-the-shelf datasets and compete immediately.
The American approach of siloed, proprietary data might produce a few dominant players. China's approach of shared infrastructure might produce an army. In robotics, where iteration speed and manufacturing scale matter more than individual breakthroughs, the army might win.
First reported by ChinaTechNews.