Picture for Shelby Heinecke

Shelby Heinecke

TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action

Add code
Dec 10, 2024
Viaarxiv icon

SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs

Add code
Nov 20, 2024
Viaarxiv icon

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Add code
Nov 06, 2024
Viaarxiv icon

PRACT: Optimizing Principled Reasoning and Acting of LLM Agent

Add code
Oct 24, 2024
Figure 1 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 2 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 3 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 4 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Viaarxiv icon

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Add code
Sep 05, 2024
Viaarxiv icon

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Add code
Aug 16, 2024
Figure 1 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 2 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 3 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Figure 4 for xGen-MM (BLIP-3): A Family of Open Large Multimodal Models
Viaarxiv icon

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

Add code
Aug 13, 2024
Viaarxiv icon

Personalized Multi-task Training for Recommender System

Add code
Jul 31, 2024
Viaarxiv icon

APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

Add code
Jun 26, 2024
Figure 1 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Figure 2 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Figure 3 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Figure 4 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Viaarxiv icon

MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases

Add code
Jun 12, 2024
Viaarxiv icon