Picture for Jaehong Yoon

Jaehong Yoon

DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation

Add code
Nov 25, 2024
Viaarxiv icon

VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement

Add code
Nov 22, 2024
Figure 1 for VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement
Figure 2 for VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement
Figure 3 for VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement
Figure 4 for VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement
Viaarxiv icon

SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation

Add code
Oct 16, 2024
Figure 1 for SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
Figure 2 for SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
Figure 3 for SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
Figure 4 for SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
Viaarxiv icon

Adapt-$\infty$: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection

Add code
Oct 14, 2024
Viaarxiv icon

Glider: Global and Local Instruction-Driven Expert Router

Add code
Oct 09, 2024
Viaarxiv icon

VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos

Add code
May 29, 2024
Figure 1 for VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Figure 2 for VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Figure 3 for VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Figure 4 for VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
Viaarxiv icon

RACCooN: Remove, Add, and Change Video Content with Auto-Generated Narratives

Add code
May 28, 2024
Viaarxiv icon

EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents

Add code
Mar 18, 2024
Figure 1 for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Figure 2 for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Figure 3 for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Figure 4 for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents
Viaarxiv icon

SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data

Add code
Mar 11, 2024
Figure 1 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Figure 2 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Figure 3 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Figure 4 for SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Viaarxiv icon

BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation

Add code
Feb 15, 2024
Figure 1 for BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation
Figure 2 for BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation
Figure 3 for BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation
Figure 4 for BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation
Viaarxiv icon