Picture for Jingang Wang

Jingang Wang

FRAMES: Boosting LLMs with A Four-Quadrant Multi-Stage Pretraining Strategy

Add code
Feb 08, 2025
Viaarxiv icon

FIRE: Flexible Integration of Data Quality Ratings for Effective Pre-Training

Add code
Feb 02, 2025
Viaarxiv icon

AgentRefine: Enhancing Agent Generalization through Refinement Tuning

Add code
Jan 03, 2025
Viaarxiv icon

Ltri-LLM: Streaming Long Context Inference for LLMs with Training-Free Dynamic Triangular Attention Pattern

Add code
Dec 06, 2024
Viaarxiv icon

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon

Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning

Add code
Nov 05, 2024
Viaarxiv icon

Multi-Programming Language Sandbox for LLMs

Add code
Oct 30, 2024
Figure 1 for Multi-Programming Language Sandbox for LLMs
Figure 2 for Multi-Programming Language Sandbox for LLMs
Figure 3 for Multi-Programming Language Sandbox for LLMs
Figure 4 for Multi-Programming Language Sandbox for LLMs
Viaarxiv icon

FIRP: Faster LLM inference via future intermediate representation prediction

Add code
Oct 27, 2024
Viaarxiv icon

Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning

Add code
Oct 09, 2024
Figure 1 for Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning
Figure 2 for Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning
Figure 3 for Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning
Figure 4 for Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning
Viaarxiv icon

Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models

Add code
Oct 08, 2024
Viaarxiv icon