Picture for Guohai Xu

Guohai Xu

REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents

Add code
Feb 15, 2026
Viaarxiv icon

DeepEyesV2: Toward Agentic Multimodal Model

Add code
Nov 10, 2025
Viaarxiv icon

DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning

Add code
May 20, 2025
Viaarxiv icon

Exploring Implicit Visual Misunderstandings in Multimodal Large Language Models through Attention Analysis

Add code
May 15, 2025
Viaarxiv icon

MLLM-Selector: Necessity and Diversity-driven High-Value Data Selection for Enhanced Visual Instruction Tuning

Add code
Mar 26, 2025
Viaarxiv icon

Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch

Add code
Feb 24, 2025
Figure 1 for Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Figure 2 for Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Figure 3 for Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Figure 4 for Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
Viaarxiv icon

An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation

Add code
Nov 13, 2023
Figure 1 for An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation
Figure 2 for An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation
Figure 3 for An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation
Figure 4 for An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation
Viaarxiv icon

UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model

Add code
Oct 08, 2023
Figure 1 for UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Figure 2 for UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Figure 3 for UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Figure 4 for UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model
Viaarxiv icon

Evaluation and Analysis of Hallucination in Large Vision-Language Models

Add code
Aug 29, 2023
Figure 1 for Evaluation and Analysis of Hallucination in Large Vision-Language Models
Figure 2 for Evaluation and Analysis of Hallucination in Large Vision-Language Models
Figure 3 for Evaluation and Analysis of Hallucination in Large Vision-Language Models
Figure 4 for Evaluation and Analysis of Hallucination in Large Vision-Language Models
Viaarxiv icon

CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility

Add code
Jul 19, 2023
Figure 1 for CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility
Figure 2 for CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility
Figure 3 for CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility
Figure 4 for CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility
Viaarxiv icon