Picture for Ke Zhang

Ke Zhang

Senior Member, IEEE

HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss

Add code
Apr 10, 2025
Viaarxiv icon

MedCL: Learning Consistent Anatomy Distribution for Scribble-supervised Medical Image Segmentation

Add code
Mar 28, 2025
Viaarxiv icon

CDKFormer: Contextual Deviation Knowledge-Based Transformer for Long-Tail Trajectory Prediction

Add code
Mar 16, 2025
Viaarxiv icon

A Survey on Foundation-Model-Based Industrial Defect Detection

Add code
Feb 26, 2025
Viaarxiv icon

IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning

Add code
Feb 04, 2025
Viaarxiv icon

Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction

Add code
Jan 19, 2025
Figure 1 for Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction
Figure 2 for Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction
Figure 3 for Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction
Figure 4 for Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction
Viaarxiv icon

Distributed satellite information networks: Architecture, enabling technologies, and trends

Add code
Dec 17, 2024
Figure 1 for Distributed satellite information networks: Architecture, enabling technologies, and trends
Figure 2 for Distributed satellite information networks: Architecture, enabling technologies, and trends
Figure 3 for Distributed satellite information networks: Architecture, enabling technologies, and trends
Figure 4 for Distributed satellite information networks: Architecture, enabling technologies, and trends
Viaarxiv icon

V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations

Add code
Dec 16, 2024
Viaarxiv icon

MoMuSE: Momentum Multi-modal Target Speaker Extraction for Real-time Scenarios with Impaired Visual Cues

Add code
Dec 11, 2024
Viaarxiv icon

TL-CLIP: A Power-specific Multimodal Pre-trained Visual Foundation Model for Transmission Line Defect Recognition

Add code
Nov 18, 2024
Figure 1 for TL-CLIP: A Power-specific Multimodal Pre-trained Visual Foundation Model for Transmission Line Defect Recognition
Figure 2 for TL-CLIP: A Power-specific Multimodal Pre-trained Visual Foundation Model for Transmission Line Defect Recognition
Figure 3 for TL-CLIP: A Power-specific Multimodal Pre-trained Visual Foundation Model for Transmission Line Defect Recognition
Figure 4 for TL-CLIP: A Power-specific Multimodal Pre-trained Visual Foundation Model for Transmission Line Defect Recognition
Viaarxiv icon