Picture for Shubham Toshniwal

Shubham Toshniwal

NVIDIA

IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark

Add code
Nov 12, 2024
Figure 1 for IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark
Figure 2 for IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark
Figure 3 for IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark
Figure 4 for IdentifyMe: A Challenging Long-Context Mention Resolution Benchmark
Viaarxiv icon

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

Add code
Oct 02, 2024
Viaarxiv icon

Major Entity Identification: A Generalizable Alternative to Coreference Resolution

Add code
Jun 20, 2024
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

Code Pretraining Improves Entity Tracking Abilities of Language Models

Add code
May 31, 2024
Viaarxiv icon

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Add code
Feb 15, 2024
Figure 1 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Figure 2 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Figure 3 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Figure 4 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Viaarxiv icon

Learning to Reason and Memorize with Self-Notes

Add code
May 01, 2023
Figure 1 for Learning to Reason and Memorize with Self-Notes
Figure 2 for Learning to Reason and Memorize with Self-Notes
Figure 3 for Learning to Reason and Memorize with Self-Notes
Figure 4 for Learning to Reason and Memorize with Self-Notes
Viaarxiv icon

Adapting Pretrained Text-to-Text Models for Long Text Sequences

Add code
Sep 21, 2022
Figure 1 for Adapting Pretrained Text-to-Text Models for Long Text Sequences
Figure 2 for Adapting Pretrained Text-to-Text Models for Long Text Sequences
Figure 3 for Adapting Pretrained Text-to-Text Models for Long Text Sequences
Figure 4 for Adapting Pretrained Text-to-Text Models for Long Text Sequences
Viaarxiv icon

Efficient and Interpretable Neural Models for Entity Tracking

Add code
Aug 30, 2022
Figure 1 for Efficient and Interpretable Neural Models for Entity Tracking
Figure 2 for Efficient and Interpretable Neural Models for Entity Tracking
Figure 3 for Efficient and Interpretable Neural Models for Entity Tracking
Figure 4 for Efficient and Interpretable Neural Models for Entity Tracking
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon