Picture for Mike Zhang

Mike Zhang

ZEST: Zero-shot Embodied Skill Transfer for Athletic Robot Control

Add code
Jan 30, 2026
Viaarxiv icon

CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data

Add code
Jan 25, 2026
Viaarxiv icon

NLPnorth @ TalentCLEF 2025: Comparing Discriminative, Contrastive, and Prompt-Based Methods for Job Title and Skill Matching

Add code
Jun 23, 2025
Viaarxiv icon

Scaling Reasoning can Improve Factuality in Large Language Models

Add code
May 16, 2025
Viaarxiv icon

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

Add code
Apr 09, 2025
Viaarxiv icon

DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers

Add code
Apr 03, 2025
Viaarxiv icon

How Do Hackathons Foster Creativity? Towards AI Collaborative Evaluation of Creativity at Scale

Add code
Mar 06, 2025
Viaarxiv icon

HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings

Add code
Feb 21, 2025
Viaarxiv icon

SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems

Add code
Feb 18, 2025
Figure 1 for SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems
Figure 2 for SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems
Figure 3 for SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems
Figure 4 for SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems
Viaarxiv icon

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Add code
Feb 18, 2025
Figure 1 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Figure 2 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Figure 3 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Figure 4 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Viaarxiv icon