Picture for Siva Reddy

Siva Reddy

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Add code
Oct 02, 2024
Figure 1 for VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Figure 2 for VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Figure 3 for VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Figure 4 for VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
Viaarxiv icon

Benchmarking Vision Language Models for Cultural Understanding

Add code
Jul 15, 2024
Viaarxiv icon

ROSA: Random Subspace Adaptation for Efficient Fine-Tuning

Add code
Jul 10, 2024
Viaarxiv icon

Learning Action and Reasoning-Centric Image Editing from Videos and Simulations

Add code
Jul 03, 2024
Viaarxiv icon

Reframing linguistic bootstrapping as joint inference using visually-grounded grammar induction models

Add code
Jun 17, 2024
Viaarxiv icon

Interpretability Needs a New Paradigm

Add code
May 08, 2024
Viaarxiv icon

Universal Adversarial Triggers Are Not Universal

Add code
Apr 24, 2024
Viaarxiv icon

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Add code
Apr 09, 2024
Viaarxiv icon

Scope Ambiguities in Large Language Models

Add code
Apr 05, 2024
Viaarxiv icon

A Compositional Typed Semantics for Universal Dependencies

Add code
Mar 02, 2024
Viaarxiv icon