Picture for Zhi Rui Tam

Zhi Rui Tam

VisTW: Benchmarking Vision-Language Models for Traditional Chinese in Taiwan

Add code
Mar 15, 2025
Viaarxiv icon

VisTai: Benchmarking Vision-Language Models for Traditional Chinese in Taiwan

Add code
Mar 13, 2025
Viaarxiv icon

Answer, Refuse, or Guess? Investigating Risk-Aware Decision Making in Language Models

Add code
Mar 03, 2025
Viaarxiv icon

None of the Above, Less of the Right: Parallel Patterns between Humans and LLMs on Multi-Choice Questions Answering

Add code
Mar 03, 2025
Viaarxiv icon

Clear Minds Think Alike: What Makes LLM Fine-tuning Robust? A Study of Token Perplexity

Add code
Jan 24, 2025
Figure 1 for Clear Minds Think Alike: What Makes LLM Fine-tuning Robust? A Study of Token Perplexity
Figure 2 for Clear Minds Think Alike: What Makes LLM Fine-tuning Robust? A Study of Token Perplexity
Figure 3 for Clear Minds Think Alike: What Makes LLM Fine-tuning Robust? A Study of Token Perplexity
Figure 4 for Clear Minds Think Alike: What Makes LLM Fine-tuning Robust? A Study of Token Perplexity
Viaarxiv icon

Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models

Add code
Aug 05, 2024
Viaarxiv icon

I Need Help! Evaluating LLM's Ability to Ask for Users' Support: A Case Study on Text-to-SQL Generation

Add code
Jul 20, 2024
Viaarxiv icon

StreamBench: Towards Benchmarking Continuous Improvement of Language Agents

Add code
Jun 13, 2024
Viaarxiv icon