Picture for Tianhao Wu

Tianhao Wu

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Add code
Jan 18, 2025
Viaarxiv icon

Computing Approximate Graph Edit Distance via Optimal Transport

Add code
Dec 25, 2024
Viaarxiv icon

From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge

Add code
Nov 25, 2024
Figure 1 for From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Figure 2 for From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Figure 3 for From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Figure 4 for From Generation to Judgment: Opportunities and Challenges of LLM-as-a-judge
Viaarxiv icon

DySpec: Faster Speculative Decoding with Dynamic Token Tree Structure

Add code
Oct 15, 2024
Viaarxiv icon

Thinking LLMs: General Instruction Following with Thought Generation

Add code
Oct 14, 2024
Figure 1 for Thinking LLMs: General Instruction Following with Thought Generation
Figure 2 for Thinking LLMs: General Instruction Following with Thought Generation
Figure 3 for Thinking LLMs: General Instruction Following with Thought Generation
Figure 4 for Thinking LLMs: General Instruction Following with Thought Generation
Viaarxiv icon

EmbedLLM: Learning Compact Representations of Large Language Models

Add code
Oct 03, 2024
Viaarxiv icon

Canonical Representation and Force-Based Pretraining of 3D Tactile for Dexterous Visuo-Tactile Policy Learning

Add code
Sep 26, 2024
Figure 1 for Canonical Representation and Force-Based Pretraining of 3D Tactile for Dexterous Visuo-Tactile Policy Learning
Figure 2 for Canonical Representation and Force-Based Pretraining of 3D Tactile for Dexterous Visuo-Tactile Policy Learning
Figure 3 for Canonical Representation and Force-Based Pretraining of 3D Tactile for Dexterous Visuo-Tactile Policy Learning
Figure 4 for Canonical Representation and Force-Based Pretraining of 3D Tactile for Dexterous Visuo-Tactile Policy Learning
Viaarxiv icon

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Add code
Jul 28, 2024
Figure 1 for Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Figure 2 for Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Figure 3 for Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Figure 4 for Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Viaarxiv icon

Expressive Gaussian Human Avatars from Monocular RGB Video

Add code
Jul 03, 2024
Figure 1 for Expressive Gaussian Human Avatars from Monocular RGB Video
Figure 2 for Expressive Gaussian Human Avatars from Monocular RGB Video
Figure 3 for Expressive Gaussian Human Avatars from Monocular RGB Video
Figure 4 for Expressive Gaussian Human Avatars from Monocular RGB Video
Viaarxiv icon

RouteLLM: Learning to Route LLMs with Preference Data

Add code
Jun 26, 2024
Figure 1 for RouteLLM: Learning to Route LLMs with Preference Data
Figure 2 for RouteLLM: Learning to Route LLMs with Preference Data
Figure 3 for RouteLLM: Learning to Route LLMs with Preference Data
Figure 4 for RouteLLM: Learning to Route LLMs with Preference Data
Viaarxiv icon