Picture for Sandeep Subramanian

Sandeep Subramanian

Pixtral 12B

Add code
Oct 09, 2024
Figure 1 for Pixtral 12B
Figure 2 for Pixtral 12B
Figure 3 for Pixtral 12B
Figure 4 for Pixtral 12B
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

Nemotron-4 15B Technical Report

Add code
Feb 27, 2024
Viaarxiv icon

Mixtral of Experts

Add code
Jan 08, 2024
Viaarxiv icon

Retrieval meets Long Context Large Language Models

Add code
Oct 04, 2023
Viaarxiv icon

Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation

Add code
Jun 02, 2022
Figure 1 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Figure 2 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Figure 3 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Figure 4 for Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation
Viaarxiv icon

NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21

Add code
Nov 16, 2021
Figure 1 for NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Figure 2 for NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Figure 3 for NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Figure 4 for NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21
Viaarxiv icon

Multi-scale Transformer Language Models

Add code
May 01, 2020
Figure 1 for Multi-scale Transformer Language Models
Figure 2 for Multi-scale Transformer Language Models
Figure 3 for Multi-scale Transformer Language Models
Figure 4 for Multi-scale Transformer Language Models
Viaarxiv icon

On Extractive and Abstractive Neural Document Summarization with Transformer Language Models

Add code
Sep 07, 2019
Figure 1 for On Extractive and Abstractive Neural Document Summarization with Transformer Language Models
Figure 2 for On Extractive and Abstractive Neural Document Summarization with Transformer Language Models
Figure 3 for On Extractive and Abstractive Neural Document Summarization with Transformer Language Models
Figure 4 for On Extractive and Abstractive Neural Document Summarization with Transformer Language Models
Viaarxiv icon

Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study

Add code
Jun 04, 2019
Figure 1 for Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study
Figure 2 for Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study
Figure 3 for Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study
Viaarxiv icon