Picture for Yuandong Tian

Yuandong Tian

Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition

Add code
Jan 04, 2025
Figure 1 for Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition
Figure 2 for Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition
Figure 3 for Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition
Figure 4 for Tensor-GaLore: Memory-Efficient Training via Gradient Tensor Decomposition
Viaarxiv icon

AdvPrefix: An Objective for Nuanced LLM Jailbreaks

Add code
Dec 13, 2024
Viaarxiv icon

Sail into the Headwind: Alignment via Robust Rewards and Dynamic Labels against Reward Hacking

Add code
Dec 12, 2024
Viaarxiv icon

Training Large Language Models to Reason in a Continuous Latent Space

Add code
Dec 09, 2024
Figure 1 for Training Large Language Models to Reason in a Continuous Latent Space
Figure 2 for Training Large Language Models to Reason in a Continuous Latent Space
Figure 3 for Training Large Language Models to Reason in a Continuous Latent Space
Figure 4 for Training Large Language Models to Reason in a Continuous Latent Space
Viaarxiv icon

Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning

Add code
Nov 21, 2024
Figure 1 for Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning
Figure 2 for Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning
Figure 3 for Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning
Figure 4 for Towards Full Delegation: Designing Ideal Agentic Behaviors for Travel Planning
Viaarxiv icon

On the Surprising Effectiveness of Attention Transfer for Vision Transformers

Add code
Nov 14, 2024
Viaarxiv icon

To the Globe (TTG): Towards Language-Driven Guaranteed Travel Planning

Add code
Oct 21, 2024
Viaarxiv icon

MagicPIG: LSH Sampling for Efficient LLM Generation

Add code
Oct 21, 2024
Figure 1 for MagicPIG: LSH Sampling for Efficient LLM Generation
Figure 2 for MagicPIG: LSH Sampling for Efficient LLM Generation
Figure 3 for MagicPIG: LSH Sampling for Efficient LLM Generation
Figure 4 for MagicPIG: LSH Sampling for Efficient LLM Generation
Viaarxiv icon

Agent-as-a-Judge: Evaluate Agents with Agents

Add code
Oct 14, 2024
Figure 1 for Agent-as-a-Judge: Evaluate Agents with Agents
Figure 2 for Agent-as-a-Judge: Evaluate Agents with Agents
Figure 3 for Agent-as-a-Judge: Evaluate Agents with Agents
Figure 4 for Agent-as-a-Judge: Evaluate Agents with Agents
Viaarxiv icon

Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces

Add code
Oct 13, 2024
Viaarxiv icon