Picture for Yuta Oshima

Yuta Oshima

Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search

Add code
Jan 31, 2025
Viaarxiv icon

ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate

Add code
Nov 05, 2024
Figure 1 for ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate
Figure 2 for ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate
Figure 3 for ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate
Figure 4 for ADOPT: Modified Adam Can Converge with Any $β_2$ with the Optimal Rate
Viaarxiv icon

Enhancing Unimodal Latent Representations in Multimodal VAEs through Iterative Amortized Inference

Add code
Oct 15, 2024
Viaarxiv icon

SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces

Add code
Mar 12, 2024
Figure 1 for SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces
Figure 2 for SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces
Figure 3 for SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces
Figure 4 for SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces
Viaarxiv icon