Picture for Shansan Gong

Shansan Gong

Why Does the Effective Context Length of LLMs Fall Short?

Add code
Oct 24, 2024
Viaarxiv icon

Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Add code
Oct 23, 2024
Viaarxiv icon

Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning

Add code
Oct 18, 2024
Viaarxiv icon

Training-Free Long-Context Scaling of Large Language Models

Add code
Feb 27, 2024
Viaarxiv icon

BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models

Add code
Feb 21, 2024
Viaarxiv icon

Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models

Add code
Feb 12, 2024
Viaarxiv icon

DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models

Add code
Oct 16, 2023
Figure 1 for DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models
Figure 2 for DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models
Figure 3 for DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models
Figure 4 for DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models
Viaarxiv icon

L-Eval: Instituting Standardized Evaluation for Long Context Language Models

Add code
Jul 31, 2023
Viaarxiv icon

In-Context Learning with Many Demonstration Examples

Add code
Feb 09, 2023
Viaarxiv icon

DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

Add code
Oct 17, 2022
Figure 1 for DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Figure 2 for DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Figure 3 for DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Figure 4 for DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Viaarxiv icon