Picture for Ziang Zhang

Ziang Zhang

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

Add code
Oct 28, 2024
Figure 1 for OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
Figure 2 for OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
Figure 3 for OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
Figure 4 for OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup
Viaarxiv icon

MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization

Add code
Oct 16, 2024
Viaarxiv icon

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Add code
Aug 29, 2024
Figure 1 for WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
Figure 2 for WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
Figure 3 for WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
Figure 4 for WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling
Viaarxiv icon

OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces

Add code
Jul 16, 2024
Viaarxiv icon

RIMformer: An End-to-End Transformer for FMCW Radar Interference Mitigation

Add code
Jul 16, 2024
Viaarxiv icon

FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion

Add code
May 10, 2024
Viaarxiv icon

Molecule-Space: Free Lunch in Unified Multimodal Space via Knowledge Fusion

Add code
May 08, 2024
Viaarxiv icon

ProMamba: Prompt-Mamba for polyp segmentation

Add code
Mar 26, 2024
Viaarxiv icon

A Segmentation Foundation Model for Diverse-type Tumors

Add code
Mar 11, 2024
Viaarxiv icon

Zero-knowledge Proof Meets Machine Learning in Verifiability: A Survey

Add code
Oct 23, 2023
Viaarxiv icon