Picture for Jingang Wang

Jingang Wang

Ltri-LLM: Streaming Long Context Inference for LLMs with Training-Free Dynamic Triangular Attention Pattern

Add code
Dec 06, 2024
Viaarxiv icon

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon

Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning

Add code
Nov 05, 2024
Viaarxiv icon

Multi-Programming Language Sandbox for LLMs

Add code
Oct 30, 2024
Figure 1 for Multi-Programming Language Sandbox for LLMs
Figure 2 for Multi-Programming Language Sandbox for LLMs
Figure 3 for Multi-Programming Language Sandbox for LLMs
Figure 4 for Multi-Programming Language Sandbox for LLMs
Viaarxiv icon

FIRP: Faster LLM inference via future intermediate representation prediction

Add code
Oct 27, 2024
Viaarxiv icon

Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning

Add code
Oct 09, 2024
Figure 1 for Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning
Figure 2 for Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning
Figure 3 for Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning
Figure 4 for Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning
Viaarxiv icon

Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models

Add code
Oct 08, 2024
Viaarxiv icon

Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models

Add code
Oct 07, 2024
Figure 1 for Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models
Figure 2 for Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models
Figure 3 for Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models
Figure 4 for Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models
Viaarxiv icon

Length Desensitization in Directed Preference Optimization

Add code
Sep 10, 2024
Figure 1 for Length Desensitization in Directed Preference Optimization
Figure 2 for Length Desensitization in Directed Preference Optimization
Figure 3 for Length Desensitization in Directed Preference Optimization
Figure 4 for Length Desensitization in Directed Preference Optimization
Viaarxiv icon

How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data

Add code
Sep 05, 2024
Viaarxiv icon