Picture for Junlin Wang

Junlin Wang

How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning

Add code
May 30, 2025
Viaarxiv icon

Grounding Bodily Awareness in Visual Representations for Efficient Policy Learning

Add code
May 24, 2025
Viaarxiv icon

Atomic Consistency Preference Optimization for Long-Form Question Answering

Add code
May 14, 2025
Viaarxiv icon

Improving Model Alignment Through Collective Intelligence of Open-Source LLMS

Add code
May 05, 2025
Viaarxiv icon

Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods

Add code
Apr 18, 2025
Viaarxiv icon

Knowing When to Stop: Dynamic Context Cutoff for Large Language Models

Add code
Feb 03, 2025
Figure 1 for Knowing When to Stop: Dynamic Context Cutoff for Large Language Models
Figure 2 for Knowing When to Stop: Dynamic Context Cutoff for Large Language Models
Figure 3 for Knowing When to Stop: Dynamic Context Cutoff for Large Language Models
Figure 4 for Knowing When to Stop: Dynamic Context Cutoff for Large Language Models
Viaarxiv icon

A Fully Parameter-Free Second-Order Algorithm for Convex-Concave Minimax Problems with Optimal Iteration Complexity

Add code
Jul 04, 2024
Figure 1 for A Fully Parameter-Free Second-Order Algorithm for Convex-Concave Minimax Problems with Optimal Iteration Complexity
Viaarxiv icon

ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods

Add code
Jun 23, 2024
Figure 1 for ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods
Figure 2 for ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods
Figure 3 for ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods
Figure 4 for ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods
Viaarxiv icon

Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies

Add code
Jun 11, 2024
Figure 1 for Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Figure 2 for Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Figure 3 for Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Figure 4 for Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies
Viaarxiv icon

Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications

Add code
Jun 10, 2024
Viaarxiv icon