Picture for Shansan Gong

Shansan Gong

Why Does the Effective Context Length of LLMs Fall Short?

Add code
Oct 24, 2024
Figure 1 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 2 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 3 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 4 for Why Does the Effective Context Length of LLMs Fall Short?
Viaarxiv icon

Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Add code
Oct 23, 2024
Figure 1 for Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Figure 2 for Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Figure 3 for Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Figure 4 for Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Viaarxiv icon

Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning

Add code
Oct 18, 2024
Viaarxiv icon

Training-Free Long-Context Scaling of Large Language Models

Add code
Feb 27, 2024
Viaarxiv icon

BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models

Add code
Feb 21, 2024
Figure 1 for BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Figure 2 for BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Figure 3 for BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Figure 4 for BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Viaarxiv icon

Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models

Add code
Feb 12, 2024
Figure 1 for Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Figure 2 for Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Figure 3 for Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Figure 4 for Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
Viaarxiv icon

DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models

Add code
Oct 16, 2023
Figure 1 for DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models
Figure 2 for DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models
Figure 3 for DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models
Figure 4 for DiffuSeq-v2: Bridging Discrete and Continuous Text Spaces for Accelerated Seq2Seq Diffusion Models
Viaarxiv icon

L-Eval: Instituting Standardized Evaluation for Long Context Language Models

Add code
Jul 31, 2023
Viaarxiv icon

In-Context Learning with Many Demonstration Examples

Add code
Feb 09, 2023
Viaarxiv icon

DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

Add code
Oct 17, 2022
Figure 1 for DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Figure 2 for DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Figure 3 for DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Figure 4 for DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models
Viaarxiv icon