Picture for Wengang Zhou

Wengang Zhou

Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters

Add code
Nov 27, 2024
Viaarxiv icon

ROOT: VLM based System for Indoor Scene Understanding and Beyond

Add code
Nov 24, 2024
Figure 1 for ROOT: VLM based System for Indoor Scene Understanding and Beyond
Figure 2 for ROOT: VLM based System for Indoor Scene Understanding and Beyond
Figure 3 for ROOT: VLM based System for Indoor Scene Understanding and Beyond
Figure 4 for ROOT: VLM based System for Indoor Scene Understanding and Beyond
Viaarxiv icon

BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?

Add code
Nov 19, 2024
Viaarxiv icon

Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning

Add code
Oct 22, 2024
Figure 1 for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
Figure 2 for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
Figure 3 for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
Figure 4 for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
Viaarxiv icon

MotionRL: Align Text-to-Motion Generation to Human Preferences with Multi-Reward Reinforcement Learning

Add code
Oct 09, 2024
Viaarxiv icon

StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting

Add code
Oct 06, 2024
Figure 1 for StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting
Figure 2 for StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting
Figure 3 for StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting
Figure 4 for StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting
Viaarxiv icon

P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task

Add code
Sep 17, 2024
Figure 1 for P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task
Figure 2 for P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task
Figure 3 for P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task
Figure 4 for P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task
Viaarxiv icon

AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding

Add code
Aug 30, 2024
Figure 1 for AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding
Figure 2 for AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding
Figure 3 for AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding
Figure 4 for AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding
Viaarxiv icon

LaneTCA: Enhancing Video Lane Detection with Temporal Context Aggregation

Add code
Aug 25, 2024
Viaarxiv icon

Scaling up Multimodal Pre-training for Sign Language Understanding

Add code
Aug 16, 2024
Viaarxiv icon