Picture for Yuxuan Wang

Yuxuan Wang

Sherman

The establishment of static digital humans and the integration with spinal models

Add code
Feb 11, 2025
Viaarxiv icon

DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation

Add code
Feb 06, 2025
Viaarxiv icon

Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation

Add code
Jan 27, 2025
Figure 1 for Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Figure 2 for Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Figure 3 for Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Figure 4 for Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
Viaarxiv icon

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Add code
Jan 10, 2025
Figure 1 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 2 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 3 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Figure 4 for MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Viaarxiv icon

LongViTU: Instruction Tuning for Long-Form Video Understanding

Add code
Jan 09, 2025
Viaarxiv icon

Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding

Add code
Dec 23, 2024
Viaarxiv icon

CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models

Add code
Dec 13, 2024
Figure 1 for CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models
Figure 2 for CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models
Figure 3 for CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models
Figure 4 for CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models
Viaarxiv icon

Pushing Rendering Boundaries: Hard Gaussian Splatting

Add code
Dec 06, 2024
Viaarxiv icon

SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation

Add code
Nov 27, 2024
Viaarxiv icon

Edge-Assisted Accelerated Cooperative Sensing for CAVs: Task Placement and Resource Allocation

Add code
Nov 27, 2024
Viaarxiv icon