Picture for Yan Wang

Yan Wang

Fudan university

Text-Guided Layer Fusion Mitigates Hallucination in Multimodal LLMs

Add code
Jan 06, 2026
Viaarxiv icon

The Illusion of Specialization: Unveiling the Domain-Invariant "Standing Committee" in Mixture-of-Experts Models

Add code
Jan 06, 2026
Viaarxiv icon

OpenOneRec Technical Report

Add code
Dec 31, 2025
Viaarxiv icon

Counterfactual VLA: Self-Reflective Vision-Language-Action Model with Adaptive Reasoning

Add code
Dec 30, 2025
Viaarxiv icon

Beyond Artifacts: Real-Centric Envelope Modeling for Reliable AI-Generated Image Detection

Add code
Dec 24, 2025
Viaarxiv icon

Benchmarking and Enhancing VLM for Compressed Image Understanding

Add code
Dec 24, 2025
Figure 1 for Benchmarking and Enhancing VLM for Compressed Image Understanding
Figure 2 for Benchmarking and Enhancing VLM for Compressed Image Understanding
Figure 3 for Benchmarking and Enhancing VLM for Compressed Image Understanding
Figure 4 for Benchmarking and Enhancing VLM for Compressed Image Understanding
Viaarxiv icon

GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting

Add code
Dec 22, 2025
Figure 1 for GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting
Figure 2 for GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting
Figure 3 for GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting
Figure 4 for GaussianImage++: Boosted Image Representation and Compression with 2D Gaussian Splatting
Viaarxiv icon

AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection

Add code
Dec 15, 2025
Figure 1 for AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection
Figure 2 for AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection
Figure 3 for AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection
Figure 4 for AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection
Viaarxiv icon

Towards Efficient and Effective Multi-Camera Encoding for End-to-End Driving

Add code
Dec 12, 2025
Viaarxiv icon

Latent Chain-of-Thought World Modeling for End-to-End Driving

Add code
Dec 11, 2025
Figure 1 for Latent Chain-of-Thought World Modeling for End-to-End Driving
Figure 2 for Latent Chain-of-Thought World Modeling for End-to-End Driving
Figure 3 for Latent Chain-of-Thought World Modeling for End-to-End Driving
Figure 4 for Latent Chain-of-Thought World Modeling for End-to-End Driving
Viaarxiv icon