Picture for Jingang Wang

Jingang Wang

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Add code
Mar 03, 2025
Viaarxiv icon

Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective

Add code
Feb 20, 2025
Viaarxiv icon

FRAMES: Boosting LLMs with A Four-Quadrant Multi-Stage Pretraining Strategy

Add code
Feb 08, 2025
Viaarxiv icon

FIRE: Flexible Integration of Data Quality Ratings for Effective Pre-Training

Add code
Feb 02, 2025
Viaarxiv icon

AgentRefine: Enhancing Agent Generalization through Refinement Tuning

Add code
Jan 03, 2025
Viaarxiv icon

Ltri-LLM: Streaming Long Context Inference for LLMs with Training-Free Dynamic Triangular Attention Pattern

Add code
Dec 06, 2024
Viaarxiv icon

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon

Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning

Add code
Nov 05, 2024
Viaarxiv icon

Multi-Programming Language Sandbox for LLMs

Add code
Oct 30, 2024
Figure 1 for Multi-Programming Language Sandbox for LLMs
Figure 2 for Multi-Programming Language Sandbox for LLMs
Figure 3 for Multi-Programming Language Sandbox for LLMs
Figure 4 for Multi-Programming Language Sandbox for LLMs
Viaarxiv icon

FIRP: Faster LLM inference via future intermediate representation prediction

Add code
Oct 27, 2024
Viaarxiv icon