Picture for Vahid Noroozi

Vahid Noroozi

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Add code
Apr 10, 2025
Viaarxiv icon

OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs

Add code
Apr 05, 2025
Viaarxiv icon

OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

Add code
Apr 02, 2025
Viaarxiv icon

Scoring Verifiers: Evaluating Synthetic Verification in Code and Reasoning

Add code
Feb 19, 2025
Viaarxiv icon

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Figure 1 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 2 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 3 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 4 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Viaarxiv icon

Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models

Add code
Jul 29, 2024
Figure 1 for Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
Figure 2 for Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
Figure 3 for Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
Figure 4 for Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
Viaarxiv icon

Instruction Data Generation and Unsupervised Adaptation for Speech Language Models

Add code
Jun 18, 2024
Figure 1 for Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Figure 2 for Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Figure 3 for Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Figure 4 for Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Figure 1 for Nemotron-4 340B Technical Report
Figure 2 for Nemotron-4 340B Technical Report
Figure 3 for Nemotron-4 340B Technical Report
Figure 4 for Nemotron-4 340B Technical Report
Viaarxiv icon

Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition

Add code
Jan 11, 2024
Viaarxiv icon

Investigating End-to-End ASR Architectures for Long Form Audio Transcription

Add code
Sep 20, 2023
Viaarxiv icon