Picture for Yikai Zhang

Yikai Zhang

Reweighting Improves Conditional Risk Bounds

Add code
Jan 04, 2025
Figure 1 for Reweighting Improves Conditional Risk Bounds
Figure 2 for Reweighting Improves Conditional Risk Bounds
Figure 3 for Reweighting Improves Conditional Risk Bounds
Viaarxiv icon

Revealing the Barriers of Language Agents in Planning

Add code
Oct 16, 2024
Figure 1 for Revealing the Barriers of Language Agents in Planning
Figure 2 for Revealing the Barriers of Language Agents in Planning
Figure 3 for Revealing the Barriers of Language Agents in Planning
Figure 4 for Revealing the Barriers of Language Agents in Planning
Viaarxiv icon

DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?

Add code
Jun 18, 2024
Figure 1 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Figure 2 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Figure 3 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Figure 4 for DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Viaarxiv icon

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Add code
Jun 18, 2024
Figure 1 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 2 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 3 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Figure 4 for OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Viaarxiv icon

Light Up the Shadows: Enhance Long-Tailed Entity Grounding with Concept-Guided Vision-Language Models

Add code
Jun 16, 2024
Viaarxiv icon

SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals

Add code
Jun 07, 2024
Figure 1 for SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
Figure 2 for SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
Figure 3 for SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
Figure 4 for SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals
Viaarxiv icon

From Persona to Personalization: A Survey on Role-Playing Language Agents

Add code
Apr 28, 2024
Viaarxiv icon

Dissecting Human and LLM Preferences

Add code
Feb 17, 2024
Viaarxiv icon

TimeArena: Shaping Efficient Multitasking Language Agents in a Time-Aware Simulation

Add code
Feb 08, 2024
Viaarxiv icon

Extending LLMs' Context Window with 100 Samples

Add code
Jan 13, 2024
Viaarxiv icon