Picture for Benyou Wang

Benyou Wang

Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion

Add code
Dec 16, 2024
Viaarxiv icon

Alignment at Pre-training! Towards Native Alignment for Arabic LLMs

Add code
Dec 04, 2024
Viaarxiv icon

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Add code
Dec 03, 2024
Viaarxiv icon

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

Add code
Nov 06, 2024
Figure 1 for Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Figure 2 for Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Figure 3 for Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Figure 4 for Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
Viaarxiv icon

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Add code
Oct 22, 2024
Figure 1 for UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Figure 2 for UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Figure 3 for UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Figure 4 for UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
Viaarxiv icon

Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement

Add code
Oct 18, 2024
Viaarxiv icon

Roadmap towards Superhuman Speech Understanding using Large Language Models

Add code
Oct 17, 2024
Figure 1 for Roadmap towards Superhuman Speech Understanding using Large Language Models
Figure 2 for Roadmap towards Superhuman Speech Understanding using Large Language Models
Figure 3 for Roadmap towards Superhuman Speech Understanding using Large Language Models
Figure 4 for Roadmap towards Superhuman Speech Understanding using Large Language Models
Viaarxiv icon

Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts

Add code
Oct 14, 2024
Figure 1 for Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
Figure 2 for Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
Figure 3 for Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
Figure 4 for Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
Viaarxiv icon

VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment

Add code
Oct 12, 2024
Figure 1 for VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Figure 2 for VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Figure 3 for VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Figure 4 for VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Viaarxiv icon

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Add code
Oct 10, 2024
Figure 1 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 2 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 3 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 4 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Viaarxiv icon