Picture for Prajjwal Bhargava

Prajjwal Bhargava

Jack

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Let's (not) just put things in Context: Test-Time Training for Long-Context LLMs

Add code
Dec 15, 2025
Viaarxiv icon

Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks

Add code
Feb 24, 2025
Figure 1 for Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks
Figure 2 for Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks
Figure 3 for Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks
Figure 4 for Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Effective Long-Context Scaling of Foundation Models

Add code
Sep 27, 2023
Figure 1 for Effective Long-Context Scaling of Foundation Models
Figure 2 for Effective Long-Context Scaling of Foundation Models
Figure 3 for Effective Long-Context Scaling of Foundation Models
Figure 4 for Effective Long-Context Scaling of Foundation Models
Viaarxiv icon

Llama 2: Open Foundation and Fine-Tuned Chat Models

Add code
Jul 19, 2023
Figure 1 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 2 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 3 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 4 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Viaarxiv icon

Sequence Modeling is a Robust Contender for Offline Reinforcement Learning

Add code
May 26, 2023
Viaarxiv icon

AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model

Add code
Mar 10, 2023
Figure 1 for AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model
Figure 2 for AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model
Figure 3 for AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model
Figure 4 for AUTODIAL: Efficient Asynchronous Task-Oriented Dialogue Model
Viaarxiv icon

DiscoSense: Commonsense Reasoning with Discourse Connectives

Add code
Oct 22, 2022
Figure 1 for DiscoSense: Commonsense Reasoning with Discourse Connectives
Figure 2 for DiscoSense: Commonsense Reasoning with Discourse Connectives
Figure 3 for DiscoSense: Commonsense Reasoning with Discourse Connectives
Figure 4 for DiscoSense: Commonsense Reasoning with Discourse Connectives
Viaarxiv icon

Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey

Add code
Jan 28, 2022
Figure 1 for Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Figure 2 for Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Figure 3 for Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Figure 4 for Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey
Viaarxiv icon