Picture for Yutong Wang

Yutong Wang

Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency

Add code
Nov 25, 2024
Figure 1 for Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Figure 2 for Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Figure 3 for Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Figure 4 for Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
Viaarxiv icon

Deploying Ten Thousand Robots: Scalable Imitation Learning for Lifelong Multi-Agent Path Finding

Add code
Oct 28, 2024
Viaarxiv icon

WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines

Add code
Oct 16, 2024
Figure 1 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Figure 2 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Figure 3 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Figure 4 for WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
Viaarxiv icon

DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory

Add code
Oct 10, 2024
Figure 1 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 2 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 3 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 4 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Viaarxiv icon

Herald: A Natural Language Annotated Lean 4 Dataset

Add code
Oct 09, 2024
Viaarxiv icon

MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering

Add code
Aug 21, 2024
Viaarxiv icon

GOReloc: Graph-based Object-Level Relocalization for Visual SLAM

Add code
Aug 15, 2024
Figure 1 for GOReloc: Graph-based Object-Level Relocalization for Visual SLAM
Figure 2 for GOReloc: Graph-based Object-Level Relocalization for Visual SLAM
Figure 3 for GOReloc: Graph-based Object-Level Relocalization for Visual SLAM
Figure 4 for GOReloc: Graph-based Object-Level Relocalization for Visual SLAM
Viaarxiv icon

TasTe: Teaching Large Language Models to Translate through Self-Reflection

Add code
Jun 12, 2024
Figure 1 for TasTe: Teaching Large Language Models to Translate through Self-Reflection
Figure 2 for TasTe: Teaching Large Language Models to Translate through Self-Reflection
Figure 3 for TasTe: Teaching Large Language Models to Translate through Self-Reflection
Figure 4 for TasTe: Teaching Large Language Models to Translate through Self-Reflection
Viaarxiv icon

LNS2+RL: Combining Multi-agent Reinforcement Learning with Large Neighborhood Search in Multi-agent Path Finding

Add code
May 28, 2024
Viaarxiv icon

AMCEN: An Attention Masking-based Contrastive Event Network for Two-stage Temporal Knowledge Graph Reasoning

Add code
May 16, 2024
Viaarxiv icon