Picture for Di Zhang

Di Zhang

Be a Multitude to Itself: A Prompt Evolution Framework for Red Teaming

Add code
Feb 22, 2025
Viaarxiv icon

SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin

Add code
Feb 19, 2025
Viaarxiv icon

FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems

Add code
Feb 19, 2025
Viaarxiv icon

iMOVE: Instance-Motion-Aware Video Understanding

Add code
Feb 18, 2025
Viaarxiv icon

DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs

Add code
Feb 18, 2025
Viaarxiv icon

Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts

Add code
Feb 18, 2025
Viaarxiv icon

VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation

Add code
Feb 18, 2025
Viaarxiv icon

CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation

Add code
Feb 12, 2025
Viaarxiv icon

Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization

Add code
Feb 03, 2025
Viaarxiv icon

Improving Video Generation with Human Feedback

Add code
Jan 23, 2025
Figure 1 for Improving Video Generation with Human Feedback
Figure 2 for Improving Video Generation with Human Feedback
Figure 3 for Improving Video Generation with Human Feedback
Figure 4 for Improving Video Generation with Human Feedback
Viaarxiv icon