Picture for Le Wang

Le Wang

Xi'an Jiaotong University

Probing Latent Knowledge Conflict for Faithful Retrieval-Augmented Generation

Add code
Oct 14, 2025
Viaarxiv icon

SAMPO:Scale-wise Autoregression with Motion PrOmpt for generative world models

Add code
Sep 19, 2025
Viaarxiv icon

AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation

Add code
Aug 01, 2025
Viaarxiv icon

Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation

Add code
Jun 24, 2025
Viaarxiv icon

AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions

Add code
Jun 17, 2025
Viaarxiv icon

Time-Unified Diffusion Policy with Action Discrimination for Robotic Manipulation

Add code
Jun 11, 2025
Viaarxiv icon

FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation

Add code
Jun 10, 2025
Figure 1 for FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation
Figure 2 for FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation
Figure 3 for FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation
Figure 4 for FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation
Viaarxiv icon

RSRNav: Reasoning Spatial Relationship for Image-Goal Navigation

Add code
Apr 25, 2025
Figure 1 for RSRNav: Reasoning Spatial Relationship for Image-Goal Navigation
Figure 2 for RSRNav: Reasoning Spatial Relationship for Image-Goal Navigation
Figure 3 for RSRNav: Reasoning Spatial Relationship for Image-Goal Navigation
Figure 4 for RSRNav: Reasoning Spatial Relationship for Image-Goal Navigation
Viaarxiv icon

From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval

Add code
Apr 25, 2025
Viaarxiv icon

Manipulating Multimodal Agents via Cross-Modal Prompt Injection

Add code
Apr 22, 2025
Viaarxiv icon