Picture for Yew Ken Chia

Yew Ken Chia

PromptDistill: Query-based Selective Token Retention in Intermediate Layers for Efficient Large Language Model Inference

Add code
Mar 30, 2025
Viaarxiv icon

The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles

Add code
Feb 03, 2025
Viaarxiv icon

M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework

Add code
Nov 09, 2024
Viaarxiv icon

Reasoning Paths Optimization: Learning to Reason and Explore From Diverse Paths

Add code
Oct 07, 2024
Viaarxiv icon

Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models

Add code
Sep 22, 2024
Viaarxiv icon

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Add code
Jul 29, 2024
Figure 1 for SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages
Figure 2 for SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages
Figure 3 for SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages
Figure 4 for SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages
Viaarxiv icon

Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions

Add code
May 30, 2024
Figure 1 for Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions
Figure 2 for Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions
Figure 3 for Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions
Figure 4 for Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions
Viaarxiv icon

PuzzleVQA: Diagnosing Multimodal Reasoning Challenges of Language Models with Abstract Visual Patterns

Add code
Mar 20, 2024
Viaarxiv icon

Contrastive Chain-of-Thought Prompting

Add code
Nov 15, 2023
Viaarxiv icon

Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning

Add code
Jul 05, 2023
Viaarxiv icon