Picture for Zhuo Chen

Zhuo Chen

refer to the report for detailed contributions

K-ON: Stacking Knowledge On the Head Layer of Large Language Model

Add code
Feb 10, 2025
Viaarxiv icon

DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation

Add code
Feb 06, 2025
Viaarxiv icon

Topic-FlipRAG: Topic-Orientated Adversarial Opinion Manipulation Attacks to Retrieval-Augmented Generation Models

Add code
Feb 03, 2025
Viaarxiv icon

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Add code
Jan 21, 2025
Figure 1 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Figure 2 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Figure 3 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Figure 4 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Viaarxiv icon

Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model

Add code
Jan 13, 2025
Viaarxiv icon

FlipedRAG: Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models

Add code
Jan 06, 2025
Viaarxiv icon

Have We Designed Generalizable Structural Knowledge Promptings? Systematic Evaluation and Rethinking

Add code
Dec 31, 2024
Viaarxiv icon

Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation

Add code
Dec 19, 2024
Viaarxiv icon

Iterative Camera-LiDAR Extrinsic Optimization via Surrogate Diffusion

Add code
Nov 17, 2024
Viaarxiv icon

CMATH: Cross-Modality Augmented Transformer with Hierarchical Variational Distillation for Multimodal Emotion Recognition in Conversation

Add code
Nov 15, 2024
Viaarxiv icon