Writing

Technical essays and research notes on AI infrastructure, agent systems, and the mechanics behind modern models.

Essays
机器人世界模型的闭环底图:long-horizon 论文都在改哪个零件

A Chinese research map built around one closed loop — a robot choosing actions with a learned imagination model. Fifty recent (2026 H1) world-model and long-horizon papers from top labs, each placed on one of seven parts of the loop: state representation, memory, dynamics, event verification, trust horizon, planner–model coupling, and the action/evaluation/data interface.

Long Horizon Is Not One Problem

An overview of the long-horizon line in one frame: six failure modes crossed against five recurring moves in a single matrix, the robot–agent isomorphism, and the open trust-horizon question. A synthesis of the rollout-drift, interfaces, and map notes below.

Long-horizon world model 的五个接口

A Chinese research note that splits long-horizon world-model work into five system interfaces: planner usage, rollout fidelity, event memory, action hierarchy, and evaluation/data infrastructure.

Long Horizon:机器人世界模型的研究矩阵

A Chinese research matrix for long-horizon robot world models and policies: rollout drift, closed-loop planning, trust horizon, event verification, temporal abstraction, and object/state persistence.

From Rollout to Context

A bilingual research map of the long-horizon problem across robot world models, robot policies, and language-model agents.

Qwen-AgentWorld:文本世界模型的边界

A Chinese research note on language world models, digital-agent simulators, next-observation prediction, and why a text model can still satisfy the world-model interface.

Two Long Horizons

The phrase "long horizon" names a reliability problem in language-model agents and a fidelity problem in world models. A bilingual comparison across METR's time horizon, self-conditioning, MBPO compounding error, Dreamer/TD-MPC planning, and Genie-style drift.

Diffusion:从噪声到数据的一条路径

A bilingual mechanism note on diffusion as iterative denoising, score estimation, latent-space generation, guidance, video diffusion, and the connection to Cosmos Policy.

TD-MPC:世界模型不必还原世界

A Chinese paper note on TD-MPC, task-oriented latent dynamics, reconstruction quality, action quality, short-horizon planning, and terminal value estimation.

Notes
Paper notes

Dense notes on papers worth understanding deeply, not summaries for engagement.

Work in public, but keep the bar high.