Picture for Bailin Wang

Bailin Wang

Massachusetts Institute of Technology

Gated Slot Attention for Efficient Linear-Time Sequence Modeling

Add code
Sep 11, 2024
Figure 1 for Gated Slot Attention for Efficient Linear-Time Sequence Modeling
Figure 2 for Gated Slot Attention for Efficient Linear-Time Sequence Modeling
Figure 3 for Gated Slot Attention for Efficient Linear-Time Sequence Modeling
Figure 4 for Gated Slot Attention for Efficient Linear-Time Sequence Modeling
Viaarxiv icon

MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models

Add code
Jun 20, 2024
Figure 1 for MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models
Figure 2 for MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models
Figure 3 for MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models
Figure 4 for MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models
Viaarxiv icon

Parallelizing Linear Transformers with the Delta Rule over Sequence Length

Add code
Jun 10, 2024
Viaarxiv icon

Language Model Evolution: An Iterated Learning Perspective

Add code
Apr 04, 2024
Figure 1 for Language Model Evolution: An Iterated Learning Perspective
Figure 2 for Language Model Evolution: An Iterated Learning Perspective
Figure 3 for Language Model Evolution: An Iterated Learning Perspective
Figure 4 for Language Model Evolution: An Iterated Learning Perspective
Viaarxiv icon

Learning to Decode Collaboratively with Multiple Language Models

Add code
Mar 06, 2024
Figure 1 for Learning to Decode Collaboratively with Multiple Language Models
Figure 2 for Learning to Decode Collaboratively with Multiple Language Models
Figure 3 for Learning to Decode Collaboratively with Multiple Language Models
Figure 4 for Learning to Decode Collaboratively with Multiple Language Models
Viaarxiv icon

In-Context Language Learning: Architectures and Algorithms

Add code
Jan 30, 2024
Viaarxiv icon

Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models

Add code
Jan 19, 2024
Viaarxiv icon

Gated Linear Attention Transformers with Hardware-Efficient Training

Add code
Dec 24, 2023
Viaarxiv icon

Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations

Add code
Nov 13, 2023
Figure 1 for Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations
Figure 2 for Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations
Figure 3 for Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations
Figure 4 for Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations
Viaarxiv icon

An Investigation of LLMs' Inefficacy in Understanding Converse Relations

Add code
Oct 25, 2023
Viaarxiv icon