Picture for Fei Jia

Fei Jia

HUAWEI

Automatically Planning Optimal Parallel Strategy for Large Language Models

Add code
Dec 31, 2024
Figure 1 for Automatically Planning Optimal Parallel Strategy for Large Language Models
Figure 2 for Automatically Planning Optimal Parallel Strategy for Large Language Models
Figure 3 for Automatically Planning Optimal Parallel Strategy for Large Language Models
Figure 4 for Automatically Planning Optimal Parallel Strategy for Large Language Models
Viaarxiv icon

Star Attention: Efficient LLM Inference over Long Sequences

Add code
Nov 26, 2024
Viaarxiv icon

Romanization Encoding For Multilingual ASR

Add code
Jul 05, 2024
Figure 1 for Romanization Encoding For Multilingual ASR
Figure 2 for Romanization Encoding For Multilingual ASR
Figure 3 for Romanization Encoding For Multilingual ASR
Figure 4 for Romanization Encoding For Multilingual ASR
Viaarxiv icon

RULER: What's the Real Context Size of Your Long-Context Language Models?

Add code
Apr 11, 2024
Viaarxiv icon

Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition

Add code
Apr 04, 2024
Figure 1 for Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition
Figure 2 for Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition
Figure 3 for Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition
Figure 4 for Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition
Viaarxiv icon

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Add code
Feb 15, 2024
Figure 1 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Figure 2 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Figure 3 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Figure 4 for OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset
Viaarxiv icon

Efficient Sequence Transduction by Jointly Predicting Tokens and Durations

Add code
Apr 13, 2023
Viaarxiv icon

Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models

Add code
Nov 09, 2022
Viaarxiv icon

Multi-blank Transducers for Speech Recognition

Add code
Nov 04, 2022
Viaarxiv icon

AmberNet: A Compact End-to-End Model for Spoken Language Identification

Add code
Oct 27, 2022
Viaarxiv icon