Multi-step denoising + Q guidance · step-by-step

The vertical axis is time t (bottom = pure noise, top = clean action), the horizontal axis is action a. The trajectory climbs upward one step at a time.

cyan diagonal dashed → endpoint â₁ (QGF query point) orange vertical dashed → a_t (OOD query point) top: value landscape Q
2.5