Picture for Yutong Wang

Yutong Wang

MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation

Add code
Feb 18, 2025
Viaarxiv icon

Hierarchical Trajectory (Re)Planning for a Large Scale Swarm

Add code
Jan 28, 2025
Viaarxiv icon

Make-A-Character 2: Animatable 3D Character Generation From a Single Image

Add code
Jan 15, 2025
Figure 1 for Make-A-Character 2: Animatable 3D Character Generation From a Single Image
Figure 2 for Make-A-Character 2: Animatable 3D Character Generation From a Single Image
Figure 3 for Make-A-Character 2: Animatable 3D Character Generation From a Single Image
Figure 4 for Make-A-Character 2: Animatable 3D Character Generation From a Single Image
Viaarxiv icon

UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility

Add code
Jan 04, 2025
Viaarxiv icon

Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency

Add code
Nov 25, 2024
Figure 1 for Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Figure 2 for Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Figure 3 for Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Figure 4 for Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Viaarxiv icon

Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding

Add code
Oct 28, 2024
Figure 1 for Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding
Figure 2 for Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding
Figure 3 for Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding
Figure 4 for Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding
Viaarxiv icon

WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines

Add code
Oct 16, 2024
Figure 1 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Figure 2 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Figure 3 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Figure 4 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Viaarxiv icon

DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory

Add code
Oct 10, 2024
Figure 1 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 2 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 3 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 4 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Viaarxiv icon

Herald: A Natural Language Annotated Lean 4 Dataset

Add code
Oct 09, 2024
Viaarxiv icon

MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering

Add code
Aug 21, 2024
Viaarxiv icon