Picture for Chen Zhang

Chen Zhang

SenseTime Research

A Survey on LLM Mid-training

Add code
Oct 27, 2025
Viaarxiv icon

Doc-Researcher: A Unified System for Multimodal Document Parsing and Deep Research

Add code
Oct 24, 2025
Viaarxiv icon

MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control

Add code
Oct 01, 2025
Figure 1 for MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control
Figure 2 for MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control
Figure 3 for MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control
Figure 4 for MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control
Viaarxiv icon

FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression

Add code
Oct 01, 2025
Figure 1 for FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression
Figure 2 for FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression
Figure 3 for FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression
Figure 4 for FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression
Viaarxiv icon

ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive

Add code
Aug 26, 2025
Figure 1 for ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive
Figure 2 for ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive
Figure 3 for ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive
Figure 4 for ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive
Viaarxiv icon

Few-shot Unknown Class Discovery of Hyperspectral Images with Prototype Learning and Clustering

Add code
Aug 25, 2025
Viaarxiv icon

ZigzagAttention: Efficient Long-Context Inference with Exclusive Retrieval and Streaming Heads

Add code
Aug 17, 2025
Viaarxiv icon

AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation

Add code
Aug 01, 2025
Viaarxiv icon

Hypernetworks for Model-Heterogeneous Personalized Federated Learning

Add code
Jul 30, 2025
Figure 1 for Hypernetworks for Model-Heterogeneous Personalized Federated Learning
Figure 2 for Hypernetworks for Model-Heterogeneous Personalized Federated Learning
Figure 3 for Hypernetworks for Model-Heterogeneous Personalized Federated Learning
Figure 4 for Hypernetworks for Model-Heterogeneous Personalized Federated Learning
Viaarxiv icon

AC-Refiner: Efficient Arithmetic Circuit Optimization Using Conditional Diffusion Models

Add code
Jul 03, 2025
Viaarxiv icon