Picture for Xiaoheng Jiang

Xiaoheng Jiang

CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation

Add code
Nov 21, 2024
Figure 1 for CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation
Figure 2 for CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation
Figure 3 for CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation
Figure 4 for CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation
Viaarxiv icon

Few-Shot Object Detection with Sparse Context Transformers

Add code
Feb 14, 2024
Viaarxiv icon

Joint Attention-Guided Feature Fusion Network for Saliency Detection of Surface Defects

Add code
Feb 05, 2024
Viaarxiv icon

ID-like Prompt Learning for Few-Shot Out-of-Distribution Detection

Add code
Nov 28, 2023
Viaarxiv icon

Decision Fusion Network with Perception Fine-tuning for Defect Classification

Add code
Sep 22, 2023
Figure 1 for Decision Fusion Network with Perception Fine-tuning for Defect Classification
Figure 2 for Decision Fusion Network with Perception Fine-tuning for Defect Classification
Figure 3 for Decision Fusion Network with Perception Fine-tuning for Defect Classification
Figure 4 for Decision Fusion Network with Perception Fine-tuning for Defect Classification
Viaarxiv icon

CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation

Add code
Sep 22, 2023
Viaarxiv icon

Global Context Aggregation Network for Lightweight Saliency Detection of Surface Defects

Add code
Sep 22, 2023
Viaarxiv icon

Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition

Add code
Oct 06, 2022
Figure 1 for Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition
Figure 2 for Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition
Figure 3 for Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition
Figure 4 for Focal and Global Spatial-Temporal Transformer for Skeleton-based Action Recognition
Viaarxiv icon

Multi-scale Feature Aggregation for Crowd Counting

Add code
Aug 11, 2022
Figure 1 for Multi-scale Feature Aggregation for Crowd Counting
Figure 2 for Multi-scale Feature Aggregation for Crowd Counting
Figure 3 for Multi-scale Feature Aggregation for Crowd Counting
Figure 4 for Multi-scale Feature Aggregation for Crowd Counting
Viaarxiv icon

User-Guided Personalized Image Aesthetic Assessment based on Deep Reinforcement Learning

Add code
Jun 14, 2021
Figure 1 for User-Guided Personalized Image Aesthetic Assessment based on Deep Reinforcement Learning
Figure 2 for User-Guided Personalized Image Aesthetic Assessment based on Deep Reinforcement Learning
Figure 3 for User-Guided Personalized Image Aesthetic Assessment based on Deep Reinforcement Learning
Figure 4 for User-Guided Personalized Image Aesthetic Assessment based on Deep Reinforcement Learning
Viaarxiv icon