Picture for Jack Lanchantin

Jack Lanchantin

NaturalThoughts: Selecting and Distilling Reasoning Traces for General Reasoning Tasks

Add code
Jul 02, 2025
Viaarxiv icon

Bridging Offline and Online Reinforcement Learning for LLMs

Add code
Jun 26, 2025
Viaarxiv icon

LLM Pretraining with Continuous Concepts

Add code
Feb 12, 2025
Viaarxiv icon

Diverse Preference Optimization

Add code
Jan 31, 2025
Figure 1 for Diverse Preference Optimization
Figure 2 for Diverse Preference Optimization
Figure 3 for Diverse Preference Optimization
Figure 4 for Diverse Preference Optimization
Viaarxiv icon

Adaptive Decoding via Latent Preference Optimization

Add code
Nov 14, 2024
Figure 1 for Adaptive Decoding via Latent Preference Optimization
Figure 2 for Adaptive Decoding via Latent Preference Optimization
Figure 3 for Adaptive Decoding via Latent Preference Optimization
Figure 4 for Adaptive Decoding via Latent Preference Optimization
Viaarxiv icon

TOOLVERIFIER: Generalization to New Tools via Self-Verification

Add code
Feb 21, 2024
Figure 1 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Figure 2 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Figure 3 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Figure 4 for TOOLVERIFIER: Generalization to New Tools via Self-Verification
Viaarxiv icon

A Data Source for Reasoning Embodied Agents

Add code
Sep 14, 2023
Viaarxiv icon

Learning to Reason and Memorize with Self-Notes

Add code
May 01, 2023
Figure 1 for Learning to Reason and Memorize with Self-Notes
Figure 2 for Learning to Reason and Memorize with Self-Notes
Figure 3 for Learning to Reason and Memorize with Self-Notes
Figure 4 for Learning to Reason and Memorize with Self-Notes
Viaarxiv icon

General Multi-label Image Classification with Transformers

Add code
Nov 27, 2020
Figure 1 for General Multi-label Image Classification with Transformers
Figure 2 for General Multi-label Image Classification with Transformers
Figure 3 for General Multi-label Image Classification with Transformers
Figure 4 for General Multi-label Image Classification with Transformers
Viaarxiv icon

Reevaluating Adversarial Examples in Natural Language

Add code
Apr 25, 2020
Figure 1 for Reevaluating Adversarial Examples in Natural Language
Figure 2 for Reevaluating Adversarial Examples in Natural Language
Figure 3 for Reevaluating Adversarial Examples in Natural Language
Figure 4 for Reevaluating Adversarial Examples in Natural Language
Viaarxiv icon