Picture for Mike Zhang

Mike Zhang

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

Add code
Apr 09, 2025
Viaarxiv icon

DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers

Add code
Apr 03, 2025
Viaarxiv icon

How Do Hackathons Foster Creativity? Towards AI Collaborative Evaluation of Creativity at Scale

Add code
Mar 06, 2025
Viaarxiv icon

HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings

Add code
Feb 21, 2025
Viaarxiv icon

SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems

Add code
Feb 18, 2025
Viaarxiv icon

On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation

Add code
Feb 18, 2025
Viaarxiv icon

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Add code
Feb 18, 2025
Viaarxiv icon

SnakModel: Lessons Learned from Training an Open Danish Large Language Model

Add code
Dec 17, 2024
Figure 1 for SnakModel: Lessons Learned from Training an Open Danish Large Language Model
Figure 2 for SnakModel: Lessons Learned from Training an Open Danish Large Language Model
Figure 3 for SnakModel: Lessons Learned from Training an Open Danish Large Language Model
Figure 4 for SnakModel: Lessons Learned from Training an Open Danish Large Language Model
Viaarxiv icon

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Add code
Nov 29, 2024
Figure 1 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 2 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 3 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 4 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Viaarxiv icon

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Figure 1 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 2 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 3 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 4 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Viaarxiv icon