(1). Motion alignment is limited by tracking quality (top row: glove).

(2). generation is constrained by the base video model (bottom row: fails on a 360° camera orbit).