Picture for Yibo Yan

Yibo Yan

CausalEmbed: Auto-Regressive Multi-Vector Generation in Latent Space for Visual Document Embedding

Add code
Jan 29, 2026
Viaarxiv icon

A Visual Semantic Adaptive Watermark grounded by Prefix-Tuning for Large Vision-Language Model

Add code
Jan 12, 2026
Viaarxiv icon

Vision-Language Introspection: Mitigating Overconfident Hallucinations in MLLMs via Interpretable Bi-Causal Steering

Add code
Jan 08, 2026
Viaarxiv icon

Distilling the Thought, Watermarking the Answer: A Principle Semantic Guided Watermark for Large Reasoning Models

Add code
Jan 08, 2026
Viaarxiv icon

EffiReason-Bench: A Unified Benchmark for Evaluating and Advancing Efficient Reasoning in Large Language Models

Add code
Nov 13, 2025
Figure 1 for EffiReason-Bench: A Unified Benchmark for Evaluating and Advancing Efficient Reasoning in Large Language Models
Figure 2 for EffiReason-Bench: A Unified Benchmark for Evaluating and Advancing Efficient Reasoning in Large Language Models
Figure 3 for EffiReason-Bench: A Unified Benchmark for Evaluating and Advancing Efficient Reasoning in Large Language Models
Figure 4 for EffiReason-Bench: A Unified Benchmark for Evaluating and Advancing Efficient Reasoning in Large Language Models
Viaarxiv icon

Sharp Eyes and Memory for VideoLLMs: Information-Aware Visual Token Pruning for Efficient and Reliable VideoLLM Reasoning

Add code
Nov 11, 2025
Viaarxiv icon

GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning

Add code
Aug 06, 2025
Viaarxiv icon

VLA-Mark: A cross modal watermark for large vision-language alignment model

Add code
Jul 18, 2025
Figure 1 for VLA-Mark: A cross modal watermark for large vision-language alignment model
Figure 2 for VLA-Mark: A cross modal watermark for large vision-language alignment model
Figure 3 for VLA-Mark: A cross modal watermark for large vision-language alignment model
Figure 4 for VLA-Mark: A cross modal watermark for large vision-language alignment model
Viaarxiv icon

Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities

Add code
May 27, 2025
Viaarxiv icon

AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

Add code
May 22, 2025
Figure 1 for AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models
Figure 2 for AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models
Figure 3 for AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models
Figure 4 for AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models
Viaarxiv icon