Picture for Weiran Yao

Weiran Yao

SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs

Add code
Nov 20, 2024
Figure 1 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Figure 2 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Figure 3 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Figure 4 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Viaarxiv icon

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Add code
Nov 06, 2024
Viaarxiv icon

PRACT: Optimizing Principled Reasoning and Acting of LLM Agent

Add code
Oct 24, 2024
Figure 1 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 2 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 3 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 4 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Viaarxiv icon

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Add code
Sep 05, 2024
Figure 1 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 2 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 3 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 4 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Viaarxiv icon

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents

Add code
Aug 13, 2024
Viaarxiv icon

APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

Add code
Jun 26, 2024
Figure 1 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Figure 2 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Figure 3 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Figure 4 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Viaarxiv icon

AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning

Add code
Feb 26, 2024
Figure 1 for AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Figure 2 for AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Figure 3 for AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Figure 4 for AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning
Viaarxiv icon

AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System

Add code
Feb 23, 2024
Figure 1 for AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System
Figure 2 for AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System
Figure 3 for AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System
Figure 4 for AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System
Viaarxiv icon

CaRiNG: Learning Temporal Causal Representation under Non-Invertible Generation Process

Add code
Jan 25, 2024
Viaarxiv icon

Causal Layering via Conditional Entropy

Add code
Jan 19, 2024
Viaarxiv icon