Picture for Gopeshh Subbaraj

Gopeshh Subbaraj

Enabling Realtime Reinforcement Learning at Scale with Staggered Asynchronous Inference

Add code
Dec 18, 2024
Viaarxiv icon

Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning

Add code
Nov 04, 2024
Figure 1 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Figure 2 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Figure 3 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Figure 4 for Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
Viaarxiv icon

GFlowNet Pretraining with Inexpensive Rewards

Add code
Sep 15, 2024
Viaarxiv icon

Continual Learning In Environments With Polynomial Mixing Times

Add code
Dec 13, 2021
Figure 1 for Continual Learning In Environments With Polynomial Mixing Times
Figure 2 for Continual Learning In Environments With Polynomial Mixing Times
Figure 3 for Continual Learning In Environments With Polynomial Mixing Times
Figure 4 for Continual Learning In Environments With Polynomial Mixing Times
Viaarxiv icon