Picture for Jialian Wu

Jialian Wu

Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation

Add code
Jun 26, 2025
Viaarxiv icon

TTT-Bench: A Benchmark for Evaluating Reasoning Ability with Simple and Novel Tic-Tac-Toe-style Games

Add code
Jun 11, 2025
Viaarxiv icon

Unleashing Hour-Scale Video Training for Long Video-Language Understanding

Add code
Jun 05, 2025
Viaarxiv icon

MOVi: Training-free Text-conditioned Multi-Object Video Generation

Add code
May 29, 2025
Viaarxiv icon

KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation

Add code
Apr 13, 2025
Viaarxiv icon

Self-Taught Agentic Long Context Understanding

Add code
Feb 21, 2025
Viaarxiv icon

Agent Laboratory: Using LLM Agents as Research Assistants

Add code
Jan 08, 2025
Figure 1 for Agent Laboratory: Using LLM Agents as Research Assistants
Figure 2 for Agent Laboratory: Using LLM Agents as Research Assistants
Figure 3 for Agent Laboratory: Using LLM Agents as Research Assistants
Figure 4 for Agent Laboratory: Using LLM Agents as Research Assistants
Viaarxiv icon

GRiT: A Generative Region-to-text Transformer for Object Understanding

Add code
Dec 01, 2022
Viaarxiv icon

Deformable VisTR: Spatio temporal deformable attention for video instance segmentation

Add code
Mar 12, 2022
Figure 1 for Deformable VisTR: Spatio temporal deformable attention for video instance segmentation
Figure 2 for Deformable VisTR: Spatio temporal deformable attention for video instance segmentation
Figure 3 for Deformable VisTR: Spatio temporal deformable attention for video instance segmentation
Figure 4 for Deformable VisTR: Spatio temporal deformable attention for video instance segmentation
Viaarxiv icon

Efficient Video Instance Segmentation via Tracklet Query and Proposal

Add code
Mar 03, 2022
Figure 1 for Efficient Video Instance Segmentation via Tracklet Query and Proposal
Figure 2 for Efficient Video Instance Segmentation via Tracklet Query and Proposal
Figure 3 for Efficient Video Instance Segmentation via Tracklet Query and Proposal
Figure 4 for Efficient Video Instance Segmentation via Tracklet Query and Proposal
Viaarxiv icon