Picture for Yuandong Tian

Yuandong Tian

AdvPrefix: An Objective for Nuanced LLM Jailbreaks

Add code
Dec 13, 2024
Viaarxiv icon

Sail into the Headwind: Alignment via Robust Rewards and Dynamic Labels against Reward Hacking

Add code
Dec 12, 2024
Viaarxiv icon

Training Large Language Models to Reason in a Continuous Latent Space

Add code
Dec 09, 2024
Viaarxiv icon

Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning

Add code
Nov 21, 2024
Viaarxiv icon

On the Surprising Effectiveness of Attention Transfer for Vision Transformers

Add code
Nov 14, 2024
Viaarxiv icon

To the Globe (TTG): Towards Language-Driven Guaranteed Travel Planning

Add code
Oct 21, 2024
Viaarxiv icon

MagicPIG: LSH Sampling for Efficient LLM Generation

Add code
Oct 21, 2024
Figure 1 for MagicPIG: LSH Sampling for Efficient LLM Generation
Figure 2 for MagicPIG: LSH Sampling for Efficient LLM Generation
Figure 3 for MagicPIG: LSH Sampling for Efficient LLM Generation
Figure 4 for MagicPIG: LSH Sampling for Efficient LLM Generation
Viaarxiv icon

Agent-as-a-Judge: Evaluate Agents with Agents

Add code
Oct 14, 2024
Viaarxiv icon

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

Add code
Oct 13, 2024
Viaarxiv icon

Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets

Add code
Oct 02, 2024
Viaarxiv icon