Picture for Qi Sun

Qi Sun

MaDiS: Taming Masked Diffusion Language Models for Sign Language Generation

Add code
Jan 27, 2026
Viaarxiv icon

Orchestrating Specialized Agents for Trustworthy Enterprise RAG

Add code
Jan 26, 2026
Viaarxiv icon

Animus3D: Text-driven 3D Animation via Motion Score Distillation

Add code
Dec 14, 2025
Figure 1 for Animus3D: Text-driven 3D Animation via Motion Score Distillation
Figure 2 for Animus3D: Text-driven 3D Animation via Motion Score Distillation
Figure 3 for Animus3D: Text-driven 3D Animation via Motion Score Distillation
Figure 4 for Animus3D: Text-driven 3D Animation via Motion Score Distillation
Viaarxiv icon

Unitho: A Unified Multi-Task Framework for Computational Lithography

Add code
Nov 14, 2025
Viaarxiv icon

OregairuChar: A Benchmark Dataset for Character Appearance Frequency Analysis in My Teen Romantic Comedy SNAFU

Add code
Nov 07, 2025
Viaarxiv icon

GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts

Add code
Sep 10, 2025
Viaarxiv icon

Cost-Aware Routing for Efficient Text-To-Image Generation

Add code
Jun 17, 2025
Figure 1 for Cost-Aware Routing for Efficient Text-To-Image Generation
Figure 2 for Cost-Aware Routing for Efficient Text-To-Image Generation
Figure 3 for Cost-Aware Routing for Efficient Text-To-Image Generation
Figure 4 for Cost-Aware Routing for Efficient Text-To-Image Generation
Viaarxiv icon

From Grounding to Manipulation: Case Studies of Foundation Model Integration in Embodied Robotic Systems

Add code
May 21, 2025
Viaarxiv icon

Advancing Sequential Numerical Prediction in Autoregressive Models

Add code
May 19, 2025
Viaarxiv icon

NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks

Add code
Apr 28, 2025
Figure 1 for NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
Figure 2 for NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
Figure 3 for NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
Figure 4 for NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks
Viaarxiv icon