Picture for Xiaoshan Yang

Xiaoshan Yang

OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling

Add code
Oct 10, 2024
Viaarxiv icon

A Comprehensive Review of Few-shot Action Recognition

Add code
Jul 20, 2024
Viaarxiv icon

Libra: Building Decoupled Vision System on Large Language Models

Add code
May 16, 2024
Viaarxiv icon

HiVG: Hierarchical Multimodal Fine-grained Modulation for Visual Grounding

Add code
Apr 20, 2024
Viaarxiv icon

Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection

Add code
Aug 30, 2023
Viaarxiv icon

Multi-modal Queried Object Detection in the Wild

Add code
May 30, 2023
Viaarxiv icon

CLIP-VG: Self-paced Curriculum Adapting of CLIP via Exploiting Pseudo-Language Labels for Visual Grounding

Add code
May 15, 2023
Viaarxiv icon

SgVA-CLIP: Semantic-guided Visual Adapting of Vision-Language Models for Few-shot Image Classification

Add code
Nov 28, 2022
Viaarxiv icon

Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding

Add code
Mar 29, 2022
Figure 1 for Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding
Figure 2 for Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding
Figure 3 for Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding
Figure 4 for Shifting More Attention to Visual Backbone: Query-modulated Refinement Networks for End-to-End Visual Grounding
Viaarxiv icon

Dynamic Hypergraph Convolutional Networks for Skeleton-Based Action Recognition

Add code
Dec 20, 2021
Figure 1 for Dynamic Hypergraph Convolutional Networks for Skeleton-Based Action Recognition
Figure 2 for Dynamic Hypergraph Convolutional Networks for Skeleton-Based Action Recognition
Figure 3 for Dynamic Hypergraph Convolutional Networks for Skeleton-Based Action Recognition
Figure 4 for Dynamic Hypergraph Convolutional Networks for Skeleton-Based Action Recognition
Viaarxiv icon