Picture for Fuxiao Liu

Fuxiao Liu

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Add code
Aug 27, 2025
Viaarxiv icon

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Viaarxiv icon

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Add code
Apr 10, 2025
Viaarxiv icon

AIDE: Agentically Improve Visual Language Model with Domain Experts

Add code
Feb 13, 2025
Figure 1 for AIDE: Agentically Improve Visual Language Model with Domain Experts
Figure 2 for AIDE: Agentically Improve Visual Language Model with Domain Experts
Figure 3 for AIDE: Agentically Improve Visual Language Model with Domain Experts
Figure 4 for AIDE: Agentically Improve Visual Language Model with Domain Experts
Viaarxiv icon

DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments

Add code
Dec 28, 2024
Figure 1 for DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments
Figure 2 for DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments
Figure 3 for DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments
Figure 4 for DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments
Viaarxiv icon

DeepFM-Crispr: Prediction of CRISPR On-Target Effects via Deep Learning

Add code
Sep 09, 2024
Viaarxiv icon

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Add code
Aug 28, 2024
Figure 1 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 2 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 3 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 4 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Viaarxiv icon

Mosaic IT: Enhancing Instruction Tuning with Data Mosaics

Add code
May 22, 2024
Viaarxiv icon

Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey

Add code
Mar 14, 2024
Figure 1 for Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey
Figure 2 for Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey
Viaarxiv icon

On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities

Add code
Feb 24, 2024
Figure 1 for On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
Figure 2 for On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
Figure 3 for On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
Figure 4 for On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
Viaarxiv icon