Picture for Li Zhu

Li Zhu

Aligning First, Then Fusing: A Novel Weakly Supervised Multimodal Violence Detection Method

Add code
Jan 13, 2025
Viaarxiv icon

A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization

Add code
Dec 27, 2024
Viaarxiv icon

Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search

Add code
Nov 26, 2024
Figure 1 for Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search
Figure 2 for Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search
Figure 3 for Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search
Figure 4 for Beyond Walking: A Large-Scale Image-Text Benchmark for Text-based Person Anomaly Search
Viaarxiv icon

DOGE: Towards Versatile Visual Document Grounding and Referring

Add code
Nov 26, 2024
Figure 1 for DOGE: Towards Versatile Visual Document Grounding and Referring
Figure 2 for DOGE: Towards Versatile Visual Document Grounding and Referring
Figure 3 for DOGE: Towards Versatile Visual Document Grounding and Referring
Figure 4 for DOGE: Towards Versatile Visual Document Grounding and Referring
Viaarxiv icon

Pseudo-Label Enhanced Prototypical Contrastive Learning for Uniformed Intent Discovery

Add code
Oct 26, 2024
Figure 1 for Pseudo-Label Enhanced Prototypical Contrastive Learning for Uniformed Intent Discovery
Figure 2 for Pseudo-Label Enhanced Prototypical Contrastive Learning for Uniformed Intent Discovery
Figure 3 for Pseudo-Label Enhanced Prototypical Contrastive Learning for Uniformed Intent Discovery
Figure 4 for Pseudo-Label Enhanced Prototypical Contrastive Learning for Uniformed Intent Discovery
Viaarxiv icon

DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain Few-shot Action Recognition

Add code
Jul 08, 2024
Figure 1 for DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain Few-shot Action Recognition
Figure 2 for DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain Few-shot Action Recognition
Figure 3 for DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain Few-shot Action Recognition
Figure 4 for DMSD-CDFSAR: Distillation from Mixed-Source Domain for Cross-Domain Few-shot Action Recognition
Viaarxiv icon

ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real World

Add code
Jun 19, 2024
Viaarxiv icon

Graph Feedback Bandits with Similar Arms

Add code
May 18, 2024
Viaarxiv icon

Multi-view Distillation based on Multi-modal Fusion for Few-shot Action Recognition

Add code
Jan 16, 2024
Viaarxiv icon

Forced Exploration in Bandit Problems

Add code
Dec 13, 2023
Viaarxiv icon