Picture for Huan Wang

Huan Wang

Stephen

TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action

Add code
Dec 10, 2024
Viaarxiv icon

Slicing Vision Transformer for Flexible Inference

Add code
Dec 06, 2024
Viaarxiv icon

Is Oracle Pruning the True Oracle?

Add code
Nov 28, 2024
Viaarxiv icon

Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models

Add code
Nov 27, 2024
Viaarxiv icon

DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models

Add code
Nov 22, 2024
Viaarxiv icon

SpecTool: A Benchmark for Characterizing Errors in Tool-Use LLMs

Add code
Nov 20, 2024
Viaarxiv icon

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Add code
Nov 06, 2024
Viaarxiv icon

CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments

Add code
Nov 04, 2024
Viaarxiv icon

PRACT: Optimizing Principled Reasoning and Acting of LLM Agent

Add code
Oct 24, 2024
Figure 1 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 2 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 3 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Figure 4 for PRACT: Optimizing Principled Reasoning and Acting of LLM Agent
Viaarxiv icon

LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field

Add code
Sep 26, 2024
Figure 1 for LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field
Figure 2 for LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field
Figure 3 for LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field
Figure 4 for LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field
Viaarxiv icon