Picture for Shoufa Chen

Shoufa Chen

ControlAR: Controllable Image Generation with Autoregressive Models

Add code
Oct 03, 2024
Figure 1 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 2 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 3 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 4 for ControlAR: Controllable Image Generation with Autoregressive Models
Viaarxiv icon

MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents

Add code
Jun 12, 2024
Viaarxiv icon

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Add code
Jun 10, 2024
Viaarxiv icon

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis

Add code
Feb 25, 2024
Viaarxiv icon

GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation

Add code
Dec 07, 2023
Figure 1 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 2 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 3 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 4 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Viaarxiv icon

FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing

Add code
Oct 09, 2023
Figure 1 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Figure 2 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Figure 3 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Figure 4 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Viaarxiv icon

Enhancing Your Trained DETRs with Box Refinement

Add code
Jul 21, 2023
Viaarxiv icon

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Add code
Jul 07, 2023
Viaarxiv icon

Going Denser with Open-Vocabulary Part Segmentation

Add code
May 18, 2023
Viaarxiv icon

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

Add code
May 11, 2023
Viaarxiv icon