Picture for Lechao Cheng

Lechao Cheng

Zhejiang Lab

ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding

Add code
Dec 17, 2024
Figure 1 for ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
Figure 2 for ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
Figure 3 for ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
Figure 4 for ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
Viaarxiv icon

Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation

Add code
Nov 25, 2024
Figure 1 for Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation
Figure 2 for Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation
Figure 3 for Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation
Figure 4 for Discrete to Continuous: Generating Smooth Transition Poses from Sign Language Observation
Viaarxiv icon

Modality Alignment Meets Federated Broadcasting

Add code
Nov 24, 2024
Viaarxiv icon

FoPru: Focal Pruning for Efficient Large Vision-Language Models

Add code
Nov 21, 2024
Viaarxiv icon

Dataset Distillers Are Good Label Denoisers In the Wild

Add code
Nov 18, 2024
Figure 1 for Dataset Distillers Are Good Label Denoisers In the Wild
Figure 2 for Dataset Distillers Are Good Label Denoisers In the Wild
Figure 3 for Dataset Distillers Are Good Label Denoisers In the Wild
Figure 4 for Dataset Distillers Are Good Label Denoisers In the Wild
Viaarxiv icon

Diffusion-based Layer-wise Semantic Reconstruction for Unsupervised Out-of-Distribution Detection

Add code
Nov 16, 2024
Viaarxiv icon

EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning

Add code
Oct 23, 2024
Figure 1 for EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning
Figure 2 for EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning
Figure 3 for EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning
Figure 4 for EntityCLIP: Entity-Centric Image-Text Matching via Multimodal Attentive Contrastive Learning
Viaarxiv icon

Benchmarking Multi-Scene Fire and Smoke Detection

Add code
Oct 22, 2024
Viaarxiv icon

Fire and Smoke Detection with Burning Intensity Representation

Add code
Oct 22, 2024
Viaarxiv icon

Shaping a Stabilized Video by Mitigating Unintended Changes for Concept-Augmented Video Editing

Add code
Oct 16, 2024
Viaarxiv icon