Picture for Vahid Noroozi

Vahid Noroozi

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Viaarxiv icon

Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models

Add code
Jul 29, 2024
Figure 1 for Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
Figure 2 for Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
Figure 3 for Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
Figure 4 for Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
Viaarxiv icon

Instruction Data Generation and Unsupervised Adaptation for Speech Language Models

Add code
Jun 18, 2024
Figure 1 for Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Figure 2 for Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Figure 3 for Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Figure 4 for Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
Viaarxiv icon

Nemotron-4 340B Technical Report

Add code
Jun 17, 2024
Viaarxiv icon

Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition

Add code
Jan 11, 2024
Viaarxiv icon

Investigating End-to-End ASR Architectures for Long Form Audio Transcription

Add code
Sep 20, 2023
Viaarxiv icon

Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition

Add code
May 19, 2023
Viaarxiv icon

SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services

Add code
May 17, 2021
Figure 1 for SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Figure 2 for SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Figure 3 for SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Figure 4 for SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services
Viaarxiv icon

SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition

Add code
Apr 06, 2021
Figure 1 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Figure 2 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Figure 3 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Figure 4 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Viaarxiv icon

Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition

Add code
Apr 05, 2021
Figure 1 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Figure 2 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Figure 3 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Figure 4 for Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition
Viaarxiv icon