Picture for Ziyao Zhang

Ziyao Zhang

ApET: Approximation-Error Guided Token Compression for Efficient VLMs

Add code
Feb 23, 2026
Viaarxiv icon

Multimodal Priors-Augmented Text-Driven 3D Human-Object Interaction Generation

Add code
Feb 11, 2026
Viaarxiv icon

Unseen Speaker and Language Adaptation for Lightweight Text-To-Speech with Adapters

Add code
Aug 25, 2025
Viaarxiv icon

Fg-T2M++: LLMs-Augmented Fine-Grained Text Driven Human Motion Generation

Add code
Feb 08, 2025
Viaarxiv icon

LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation

Add code
Sep 30, 2024
Figure 1 for LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation
Figure 2 for LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation
Figure 3 for LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation
Figure 4 for LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation
Viaarxiv icon

Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech

Add code
Sep 15, 2023
Viaarxiv icon

First Glance Diagnosis: Brain Disease Classification with Single fMRI Volume

Add code
Aug 10, 2022
Figure 1 for First Glance Diagnosis: Brain Disease Classification with Single fMRI Volume
Figure 2 for First Glance Diagnosis: Brain Disease Classification with Single fMRI Volume
Figure 3 for First Glance Diagnosis: Brain Disease Classification with Single fMRI Volume
Figure 4 for First Glance Diagnosis: Brain Disease Classification with Single fMRI Volume
Viaarxiv icon

Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)

Add code
Jul 04, 2022
Figure 1 for Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)
Figure 2 for Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)
Figure 3 for Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)
Figure 4 for Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)
Viaarxiv icon

Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech

Add code
Jul 04, 2022
Figure 1 for Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech
Figure 2 for Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech
Figure 3 for Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech
Figure 4 for Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech
Viaarxiv icon

State Action Separable Reinforcement Learning

Add code
Jun 05, 2020
Figure 1 for State Action Separable Reinforcement Learning
Figure 2 for State Action Separable Reinforcement Learning
Figure 3 for State Action Separable Reinforcement Learning
Viaarxiv icon