We propose a novel Masked Temporal Interpolation Diffusion (MTID) model for procedure planning in instructional videos. The paper is officially accepted at ICLR 2025.
Jan 23, 2025