Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:On-Chip Learning via Transformer In-Context Learning

Oct 11, 2024

Jan Finkbeiner, Emre Neftci

Figure 1 for On-Chip Learning via Transformer In-Context Learning

Figure 2 for On-Chip Learning via Transformer In-Context Learning

Figure 3 for On-Chip Learning via Transformer In-Context Learning

Figure 4 for On-Chip Learning via Transformer In-Context Learning

Share this with someone who'll enjoy it:

Abstract:Autoregressive decoder-only transformers have become key components for scalable sequence processing and generation models. However, the transformer's self-attention mechanism requires transferring prior token projections from the main memory at each time step (token), thus severely limiting their performance on conventional processors. Self-attention can be viewed as a dynamic feed-forward layer, whose matrix is input sequence-dependent similarly to the result of local synaptic plasticity. Using this insight, we present a neuromorphic decoder-only transformer model that utilizes an on-chip plasticity processor to compute self-attention. Interestingly, the training of transformers enables them to ``learn'' the input context during inference. We demonstrate this in-context learning ability of transformers on the Loihi 2 processor by solving a few-shot classification problem. With this we emphasize the importance of pretrained models especially their ability to find simple, local, backpropagation free, learning rules enabling on-chip learning and adaptation in a hardware friendly manner.

View paper on

Share this with someone who'll enjoy it:

Title:On-Chip Learning via Transformer In-Context Learning

Paper and Code