Picture for Jin Xu

Jin Xu

Pelican-Unified 1.0: A Unified Embodied Intelligence Model for Understanding, Reasoning, Imagination and Action

Add code
May 14, 2026
Viaarxiv icon

Model Forensics in AI-Native Wireless Networks: Taxonomy, Applications, and Case Study

Add code
May 14, 2026
Viaarxiv icon

Nemotron 3 Nano Omni: Efficient and Open Multimodal Intelligence

Add code
Apr 27, 2026
Viaarxiv icon

DT2IT-MRM: Debiased Preference Construction and Iterative Training for Multimodal Reward Modeling

Add code
Apr 21, 2026
Viaarxiv icon

A Synonymous Variational Perspective on the Rate-Distortion-Perception Tradeoff

Add code
Apr 16, 2026
Viaarxiv icon

Semantic-Aware Interruption Detection in Spoken Dialogue Systems: Benchmark, Metric, and Model

Add code
Mar 25, 2026
Viaarxiv icon

The Interspeech 2026 Audio Reasoning Challenge: Evaluating Reasoning Process Quality for Audio Reasoning Models and Agents

Add code
Feb 15, 2026
Viaarxiv icon

Qwen3-ASR Technical Report

Add code
Jan 29, 2026
Viaarxiv icon

LLM-ForcedAligner: A Non-Autoregressive and Accurate LLM-Based Forced Aligner for Multilingual and Long-Form Speech

Add code
Jan 26, 2026
Viaarxiv icon

Qwen3-TTS Technical Report

Add code
Jan 22, 2026
Viaarxiv icon