Picture for Shirley Kokane

Shirley Kokane

SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs

Add code
Nov 20, 2024
Figure 1 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Figure 2 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Figure 3 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Figure 4 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Viaarxiv icon

PRACT: Optimizing Principled Reasoning and Acting of LLM Agent

Add code
Oct 24, 2024
Figure 1 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 2 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 3 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 4 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Viaarxiv icon

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Add code
Sep 05, 2024
Figure 1 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 2 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 3 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 4 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Viaarxiv icon

Improving Knowledge Distillation in Transfer Learning with Layer-wise Learning Rates

Add code
Jul 05, 2024
Figure 1 for Improving Knowledge Distillation in Transfer Learning with Layer-wise Learning Rates
Figure 2 for Improving Knowledge Distillation in Transfer Learning with Layer-wise Learning Rates
Figure 3 for Improving Knowledge Distillation in Transfer Learning with Layer-wise Learning Rates
Figure 4 for Improving Knowledge Distillation in Transfer Learning with Layer-wise Learning Rates
Viaarxiv icon

APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

Add code
Jun 26, 2024
Figure 1 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Figure 2 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Figure 3 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Figure 4 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Viaarxiv icon

MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases

Add code
Jun 12, 2024
Viaarxiv icon