Picture for Ming Zhu

Ming Zhu

APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay

Add code
Apr 08, 2025
Viaarxiv icon

DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation

Add code
Apr 07, 2025
Viaarxiv icon

ActionStudio: A Lightweight Framework for Data and Training of Large Action Models

Add code
Mar 31, 2025
Viaarxiv icon

PersonaBench: Evaluating AI Models on Understanding Personal Information through Accessing (Synthetic) Private User Data

Add code
Feb 28, 2025
Viaarxiv icon

SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs

Add code
Nov 20, 2024
Figure 1 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Figure 2 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Figure 3 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Figure 4 for SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs
Viaarxiv icon

PRACT: Optimizing Principled Reasoning and Acting of LLM Agent

Add code
Oct 24, 2024
Figure 1 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 2 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 3 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 4 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Viaarxiv icon

Development of a Platform to Enable Real Time, Non-disruptive Testing and Early Fault Detection of Critical High Voltage Transformers and Switchgears in High Speed-rail

Add code
Oct 01, 2024
Viaarxiv icon

MOSS: Enabling Code-Driven Evolution and Context Management for AI Agents

Add code
Sep 24, 2024
Viaarxiv icon

xLAM: A Family of Large Action Models to Empower AI Agent Systems

Add code
Sep 05, 2024
Figure 1 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 2 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 3 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Figure 4 for xLAM: A Family of Large Action Models to Empower AI Agent Systems
Viaarxiv icon

APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets

Add code
Jun 26, 2024
Figure 1 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Figure 2 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Figure 3 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Figure 4 for APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
Viaarxiv icon