Without trajID, accurate trajectory control fails due to ambiguous correspondences. |
Without segID, newly emerging regions are generated randomly; With segID, they follow the segmentation but lose appearance cues; with all attributes, generation matches the GT. |
ControlNet-style injection provides limited control, whereas ours achieves accurate motion. |
Random mixing or reversed schedules degrade performance, while our annealing strategy preserves accurate alignment. |