Picture for Mengchen Liu

Mengchen Liu

Stephen

Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models

Add code
Jan 26, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

InfoChartQA: A Benchmark for Multimodal Question Answering on Infographic Charts

Add code
May 25, 2025
Viaarxiv icon

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Add code
Mar 03, 2025
Figure 1 for Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Figure 2 for Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Figure 3 for Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Figure 4 for Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Viaarxiv icon

Benchmarking Large and Small MLLMs

Add code
Jan 04, 2025
Figure 1 for Benchmarking Large and Small MLLMs
Figure 2 for Benchmarking Large and Small MLLMs
Figure 3 for Benchmarking Large and Small MLLMs
Figure 4 for Benchmarking Large and Small MLLMs
Viaarxiv icon

ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities

Add code
Oct 08, 2024
Figure 1 for ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities
Figure 2 for ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities
Figure 3 for ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities
Figure 4 for ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities
Viaarxiv icon

SynChart: Synthesizing Charts from Language Models

Add code
Sep 25, 2024
Viaarxiv icon

On Pre-training of Multimodal Language Models Customized for Chart Understanding

Add code
Jul 19, 2024
Figure 1 for On Pre-training of Multimodal Language Models Customized for Chart Understanding
Figure 2 for On Pre-training of Multimodal Language Models Customized for Chart Understanding
Figure 3 for On Pre-training of Multimodal Language Models Customized for Chart Understanding
Figure 4 for On Pre-training of Multimodal Language Models Customized for Chart Understanding
Viaarxiv icon

Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search

Add code
Mar 15, 2024
Figure 1 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 2 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 3 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Figure 4 for Real-Time Image Segmentation via Hybrid Convolutional-Transformer Architecture Search
Viaarxiv icon

An Evaluation of GPT-4V and Gemini in Online VQA

Add code
Dec 17, 2023
Figure 1 for An Evaluation of GPT-4V and Gemini in Online VQA
Figure 2 for An Evaluation of GPT-4V and Gemini in Online VQA
Figure 3 for An Evaluation of GPT-4V and Gemini in Online VQA
Figure 4 for An Evaluation of GPT-4V and Gemini in Online VQA
Viaarxiv icon