Picture for Benyou Wang

Benyou Wang

Enabling Scalable Oversight via Self-Evolving Critic

Add code
Jan 10, 2025
Viaarxiv icon

RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions

Add code
Dec 31, 2024
Viaarxiv icon

On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Add code
Dec 28, 2024
Viaarxiv icon

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Add code
Dec 25, 2024
Figure 1 for HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
Figure 2 for HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
Figure 3 for HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
Figure 4 for HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
Viaarxiv icon

Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion

Add code
Dec 16, 2024
Figure 1 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Figure 2 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Figure 3 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Figure 4 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Viaarxiv icon

BlenderLLM: Training Large Language Models for Computer-Aided Design with Self-improvement

Add code
Dec 16, 2024
Viaarxiv icon

Alignment at Pre-training! Towards Native Alignment for Arabic LLMs

Add code
Dec 04, 2024
Viaarxiv icon

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Add code
Dec 03, 2024
Viaarxiv icon

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

Add code
Nov 06, 2024
Figure 1 for Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Figure 2 for Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Figure 3 for Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Figure 4 for Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Viaarxiv icon

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Add code
Oct 22, 2024
Figure 1 for UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Figure 2 for UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Figure 3 for UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Figure 4 for UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Viaarxiv icon