Picture for Shunsi Zhang

Shunsi Zhang

FlexGen: Flexible Multi-View Generation from Text and Image Inputs

Add code
Oct 14, 2024
Viaarxiv icon

MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer

Add code
Sep 01, 2024
Viaarxiv icon

Adaptive-avg-pooling based Attention Vision Transformer for Face Anti-spoofing

Add code
Jan 10, 2024
Viaarxiv icon