Picture for Xu Zhang

Xu Zhang

GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation

Add code
Oct 02, 2025
Viaarxiv icon

SimCroP: Radiograph Representation Learning with Similarity-driven Cross-granularity Pre-training

Add code
Sep 10, 2025
Viaarxiv icon

Incorporating Legal Logic into Deep Learning: An Intelligent Approach to Probation Prediction

Add code
Aug 17, 2025
Viaarxiv icon

Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic Inference

Add code
Jul 02, 2025
Viaarxiv icon

PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation

Add code
Jun 17, 2025
Viaarxiv icon

A General Knowledge Injection Framework for ICD Coding

Add code
May 24, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion

Add code
May 13, 2025
Figure 1 for Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion
Figure 2 for Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion
Figure 3 for Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion
Figure 4 for Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion
Viaarxiv icon

Platonic Grounding for Efficient Multimodal Language Models

Add code
Apr 27, 2025
Figure 1 for Platonic Grounding for Efficient Multimodal Language Models
Figure 2 for Platonic Grounding for Efficient Multimodal Language Models
Figure 3 for Platonic Grounding for Efficient Multimodal Language Models
Figure 4 for Platonic Grounding for Efficient Multimodal Language Models
Viaarxiv icon

C-FAITH: A Chinese Fine-Grained Benchmark for Automated Hallucination Evaluation

Add code
Apr 14, 2025
Viaarxiv icon