Picture for Ting-Han Fan

Ting-Han Fan

Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation

Add code
Nov 15, 2023
Figure 1 for Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation
Figure 2 for Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation
Figure 3 for Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation
Figure 4 for Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation
Viaarxiv icon

Advancing Regular Language Reasoning in Linear Recurrent Neural Networks

Add code
Sep 14, 2023
Viaarxiv icon

Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings

Add code
May 23, 2023
Figure 1 for Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings
Figure 2 for Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings
Figure 3 for Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings
Figure 4 for Latent Positional Information is in the Self-Attention Variance of Transformer Language Models Without Positional Embeddings
Viaarxiv icon

Transformer Working Memory Enables Regular Language Reasoning and Natural Language Length Extrapolation

Add code
May 05, 2023
Figure 1 for Transformer Working Memory Enables Regular Language Reasoning and Natural Language Length Extrapolation
Figure 2 for Transformer Working Memory Enables Regular Language Reasoning and Natural Language Length Extrapolation
Figure 3 for Transformer Working Memory Enables Regular Language Reasoning and Natural Language Length Extrapolation
Figure 4 for Transformer Working Memory Enables Regular Language Reasoning and Natural Language Length Extrapolation
Viaarxiv icon

Receptive Field Alignment Enables Transformer Length Extrapolation

Add code
Dec 20, 2022
Viaarxiv icon

Training Discrete Deep Generative Models via Gapped Straight-Through Estimator

Add code
Jun 15, 2022
Figure 1 for Training Discrete Deep Generative Models via Gapped Straight-Through Estimator
Figure 2 for Training Discrete Deep Generative Models via Gapped Straight-Through Estimator
Figure 3 for Training Discrete Deep Generative Models via Gapped Straight-Through Estimator
Figure 4 for Training Discrete Deep Generative Models via Gapped Straight-Through Estimator
Viaarxiv icon

KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation

Add code
May 20, 2022
Figure 1 for KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation
Figure 2 for KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation
Figure 3 for KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation
Figure 4 for KERPLE: Kernelized Relative Positional Embedding for Length Extrapolation
Viaarxiv icon

Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective

Add code
Oct 06, 2021
Figure 1 for Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Figure 2 for Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Figure 3 for Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Figure 4 for Explaining Off-Policy Actor-Critic From A Bias-Variance Perspective
Viaarxiv icon

PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems

Add code
Sep 20, 2021
Figure 1 for PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems
Figure 2 for PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems
Figure 3 for PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems
Figure 4 for PowerGym: A Reinforcement Learning Environment for Volt-Var Control in Power Distribution Systems
Viaarxiv icon

Soft Actor-Critic With Integer Actions

Add code
Sep 17, 2021
Figure 1 for Soft Actor-Critic With Integer Actions
Figure 2 for Soft Actor-Critic With Integer Actions
Figure 3 for Soft Actor-Critic With Integer Actions
Figure 4 for Soft Actor-Critic With Integer Actions
Viaarxiv icon