Picture for Chen Zhang

Chen Zhang

SenseTime Research

MiLiC-Eval: Benchmarking Multilingual LLMs for China's Minority Languages

Add code
Mar 03, 2025
Viaarxiv icon

Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis

Add code
Feb 26, 2025
Viaarxiv icon

KAPPA: A Generic Patent Analysis Framework with Keyphrase-Based Portraits

Add code
Feb 18, 2025
Viaarxiv icon

HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation

Add code
Feb 10, 2025
Figure 1 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Figure 2 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Figure 3 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Figure 4 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Viaarxiv icon

A Survey on Multi-Turn Interaction Capabilities of Large Language Models

Add code
Jan 17, 2025
Viaarxiv icon

Data and System Perspectives of Sustainable Artificial Intelligence

Add code
Jan 13, 2025
Viaarxiv icon

Dialogue Language Model with Large-Scale Persona Data Engineering

Add code
Dec 12, 2024
Viaarxiv icon

ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty

Add code
Dec 12, 2024
Figure 1 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Figure 2 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Figure 3 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Figure 4 for ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty
Viaarxiv icon

Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task

Add code
Oct 31, 2024
Figure 1 for Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
Figure 2 for Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
Figure 3 for Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
Figure 4 for Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
Viaarxiv icon

FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent Space

Add code
Oct 28, 2024
Viaarxiv icon