Picture for Zhe Chen

Zhe Chen

Swin-X2S: Reconstructing 3D Shape from 2D Biplanar X-ray with Swin Transformers

Add code
Jan 10, 2025
Viaarxiv icon

Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications

Add code
Jan 05, 2025
Viaarxiv icon

LCFed: An Efficient Clustered Federated Learning Framework for Heterogeneous Data

Add code
Jan 03, 2025
Figure 1 for LCFed: An Efficient Clustered Federated Learning Framework for Heterogeneous Data
Figure 2 for LCFed: An Efficient Clustered Federated Learning Framework for Heterogeneous Data
Figure 3 for LCFed: An Efficient Clustered Federated Learning Framework for Heterogeneous Data
Figure 4 for LCFed: An Efficient Clustered Federated Learning Framework for Heterogeneous Data
Viaarxiv icon

LEO-Split: A Semi-Supervised Split Learning Framework over LEO Satellite Networks

Add code
Jan 02, 2025
Figure 1 for LEO-Split: A Semi-Supervised Split Learning Framework over LEO Satellite Networks
Figure 2 for LEO-Split: A Semi-Supervised Split Learning Framework over LEO Satellite Networks
Figure 3 for LEO-Split: A Semi-Supervised Split Learning Framework over LEO Satellite Networks
Figure 4 for LEO-Split: A Semi-Supervised Split Learning Framework over LEO Satellite Networks
Viaarxiv icon

Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP

Add code
Dec 27, 2024
Figure 1 for Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Figure 2 for Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Figure 3 for Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Figure 4 for Toward Modality Gap: Vision Prototype Learning for Weakly-supervised Semantic Segmentation with CLIP
Viaarxiv icon

EPE-P: Evidence-based Parameter-efficient Prompting for Multimodal Learning with Missing Modalities

Add code
Dec 23, 2024
Viaarxiv icon

HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding

Add code
Dec 20, 2024
Viaarxiv icon

PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models

Add code
Dec 12, 2024
Figure 1 for PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
Figure 2 for PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
Figure 3 for PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
Figure 4 for PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
Viaarxiv icon

Hierarchical Split Federated Learning: Convergence Analysis and System Optimization

Add code
Dec 10, 2024
Viaarxiv icon

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Add code
Dec 06, 2024
Viaarxiv icon