Picture for Yaxiong Wang

Yaxiong Wang

Towards Micro-Action Recognition with Limited Annotations: An Asynchronous Pseudo Labeling and Training Approach

Add code
Apr 10, 2025
Viaarxiv icon

Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generation

Add code
Mar 31, 2025
Viaarxiv icon

Text-Driven Diffusion Model for Sign Language Production

Add code
Mar 20, 2025
Viaarxiv icon

Knowledge Swapping via Learning and Unlearning

Add code
Feb 12, 2025
Viaarxiv icon

A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization

Add code
Dec 27, 2024
Viaarxiv icon

ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding

Add code
Dec 17, 2024
Figure 1 for ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
Figure 2 for ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
Figure 3 for ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
Figure 4 for ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
Viaarxiv icon

Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery

Add code
Dec 07, 2024
Figure 1 for Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery
Figure 2 for Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery
Figure 3 for Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery
Figure 4 for Neighborhood Commonality-aware Evolution Network for Continuous Generalized Category Discovery
Viaarxiv icon

Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search

Add code
Nov 26, 2024
Figure 1 for Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search
Figure 2 for Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search
Figure 3 for Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search
Figure 4 for Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search
Viaarxiv icon

EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning

Add code
Oct 23, 2024
Figure 1 for EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning
Figure 2 for EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning
Figure 3 for EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning
Figure 4 for EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning
Viaarxiv icon

Knowledge Adaptation Network for Few-Shot Class-Incremental Learning

Add code
Sep 18, 2024
Figure 1 for Knowledge Adaptation Network for Few-Shot Class-Incremental Learning
Figure 2 for Knowledge Adaptation Network for Few-Shot Class-Incremental Learning
Figure 3 for Knowledge Adaptation Network for Few-Shot Class-Incremental Learning
Figure 4 for Knowledge Adaptation Network for Few-Shot Class-Incremental Learning
Viaarxiv icon