Picture for Xiong Wang

Xiong Wang

Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Add code
Nov 01, 2024
Viaarxiv icon

A Transcription Prompt-based Efficient Audio Large Language Model for Robust Speech Recognition

Add code
Aug 18, 2024
Viaarxiv icon

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Add code
Aug 09, 2024
Figure 1 for VITA: Towards Open-Source Interactive Omni Multimodal LLM
Figure 2 for VITA: Towards Open-Source Interactive Omni Multimodal LLM
Figure 3 for VITA: Towards Open-Source Interactive Omni Multimodal LLM
Figure 4 for VITA: Towards Open-Source Interactive Omni Multimodal LLM
Viaarxiv icon

Interacting Particle Systems on Networks: joint inference of the network and the interaction kernel

Add code
Feb 13, 2024
Figure 1 for Interacting Particle Systems on Networks: joint inference of the network and the interaction kernel
Figure 2 for Interacting Particle Systems on Networks: joint inference of the network and the interaction kernel
Figure 3 for Interacting Particle Systems on Networks: joint inference of the network and the interaction kernel
Figure 4 for Interacting Particle Systems on Networks: joint inference of the network and the interaction kernel
Viaarxiv icon

Optimal minimax rate of learning interaction kernels

Add code
Nov 28, 2023
Viaarxiv icon

FedSN: A General Federated Learning Framework over LEO Satellite Networks

Add code
Nov 02, 2023
Viaarxiv icon

Fusing Monocular Images and Sparse IMU Signals for Real-time Human Motion Capture

Add code
Sep 01, 2023
Viaarxiv icon

DCCRN-KWS: an audio bias based model for noise robust small-footprint keyword spotting

Add code
May 23, 2023
Viaarxiv icon

Two Stage Contextual Word Filtering for Context bias in Unified Streaming and Non-streaming Transducer

Add code
Jan 17, 2023
Viaarxiv icon

A Data-Adaptive Prior for Bayesian Learning of Kernels in Operators

Add code
Dec 29, 2022
Viaarxiv icon