Picture for Felix Juefei-Xu

Felix Juefei-Xu

MLLM-as-a-Judge for Image Safety without Human Labeling

Add code
Dec 31, 2024
Viaarxiv icon

Concept Guided Co-saliency Objection Detection

Add code
Dec 21, 2024
Viaarxiv icon

DirectorLLM for Human-Centric Video Generation

Add code
Dec 19, 2024
Figure 1 for DirectorLLM for Human-Centric Video Generation
Figure 2 for DirectorLLM for Human-Centric Video Generation
Figure 3 for DirectorLLM for Human-Centric Video Generation
Figure 4 for DirectorLLM for Human-Centric Video Generation
Viaarxiv icon

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Add code
Dec 13, 2024
Viaarxiv icon

LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity

Add code
Dec 13, 2024
Figure 1 for LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Figure 2 for LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Figure 3 for LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Figure 4 for LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity
Viaarxiv icon

Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation

Add code
Dec 03, 2024
Figure 1 for Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Figure 2 for Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Figure 3 for Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Figure 4 for Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Viaarxiv icon

Accelerating Multimodel Large Language Models by Searching Optimal Vision Token Reduction

Add code
Nov 30, 2024
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models

Add code
Oct 10, 2024
Viaarxiv icon

Pixel-Space Post-Training of Latent Diffusion Models

Add code
Sep 26, 2024
Viaarxiv icon