Picture for Jiajun Deng

Jiajun Deng

ARCHI-TTS: A flow-matching-based Text-to-Speech Model with Self-supervised Semantic Aligner and Accelerated Inference

Add code
Feb 05, 2026
Viaarxiv icon

Environment-Aware Adaptive Pruning with Interleaved Inference Orchestration for Vision-Language-Action Models

Add code
Jan 31, 2026
Viaarxiv icon

BookNet: Book Image Rectification via Cross-Page Attention Network

Add code
Jan 29, 2026
Viaarxiv icon

LISN: Language-Instructed Social Navigation with VLM-based Controller Modulating

Add code
Dec 10, 2025
Viaarxiv icon

CLEAR: Continuous Latent Autoregressive Modeling for High-quality and Low-latency Speech Synthesis

Add code
Aug 26, 2025
Viaarxiv icon

VLMPlanner: Integrating Visual Language Models with Motion Planning

Add code
Jul 27, 2025
Viaarxiv icon

MOPSA: Mixture of Prompt-Experts Based Speaker Adaptation for Elderly Speech Recognition

Add code
May 30, 2025
Figure 1 for MOPSA: Mixture of Prompt-Experts Based Speaker Adaptation for Elderly Speech Recognition
Figure 2 for MOPSA: Mixture of Prompt-Experts Based Speaker Adaptation for Elderly Speech Recognition
Figure 3 for MOPSA: Mixture of Prompt-Experts Based Speaker Adaptation for Elderly Speech Recognition
Figure 4 for MOPSA: Mixture of Prompt-Experts Based Speaker Adaptation for Elderly Speech Recognition
Viaarxiv icon

SpatialSplat: Efficient Semantic 3D from Sparse Unposed Images

Add code
May 29, 2025
Viaarxiv icon

On-the-fly Routing for Zero-shot MoE Speaker Adaptation of Speech Foundation Models for Dysarthric Speech Recognition

Add code
May 28, 2025
Figure 1 for On-the-fly Routing for Zero-shot MoE Speaker Adaptation of Speech Foundation Models for Dysarthric Speech Recognition
Figure 2 for On-the-fly Routing for Zero-shot MoE Speaker Adaptation of Speech Foundation Models for Dysarthric Speech Recognition
Figure 3 for On-the-fly Routing for Zero-shot MoE Speaker Adaptation of Speech Foundation Models for Dysarthric Speech Recognition
Figure 4 for On-the-fly Routing for Zero-shot MoE Speaker Adaptation of Speech Foundation Models for Dysarthric Speech Recognition
Viaarxiv icon

Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots

Add code
May 26, 2025
Viaarxiv icon