Picture for Chen Zhang

Chen Zhang

SenseTime Research

Dialogue Language Model with Large-Scale Persona Data Engineering

Add code
Dec 12, 2024
Viaarxiv icon

ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty

Add code
Dec 12, 2024
Viaarxiv icon

Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task

Add code
Oct 31, 2024
Figure 1 for Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
Figure 2 for Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
Figure 3 for Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
Figure 4 for Technical Report for SoccerNet Challenge 2022 -- Replay Grounding Task
Viaarxiv icon

FreqMark: Invisible Image Watermarking via Frequency Based Optimization in Latent Space

Add code
Oct 28, 2024
Viaarxiv icon

VoiceBench: Benchmarking LLM-Based Voice Assistants

Add code
Oct 22, 2024
Viaarxiv icon

Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification

Add code
Oct 22, 2024
Figure 1 for Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification
Figure 2 for Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification
Figure 3 for Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification
Figure 4 for Fast Graph Sharpness-Aware Minimization for Enhancing and Accelerating Few-Shot Node Classification
Viaarxiv icon

MoDification: Mixture of Depths Made Easy

Add code
Oct 18, 2024
Figure 1 for MoDification: Mixture of Depths Made Easy
Figure 2 for MoDification: Mixture of Depths Made Easy
Figure 3 for MoDification: Mixture of Depths Made Easy
Figure 4 for MoDification: Mixture of Depths Made Easy
Viaarxiv icon

MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes

Add code
Oct 09, 2024
Figure 1 for MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
Figure 2 for MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
Figure 3 for MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
Figure 4 for MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
Viaarxiv icon

Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning

Add code
Oct 07, 2024
Viaarxiv icon

Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models

Add code
Sep 27, 2024
Figure 1 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Figure 2 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Figure 3 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Figure 4 for Beyond Single-Audio: Advancing Multi-Audio Processing in Audio Large Language Models
Viaarxiv icon