Picture for Lei Zhu

Lei Zhu

Provable Ordering and Continuity in Vision-Language Pretraining for Generalizable Embodied Agents

Add code
Feb 03, 2025
Viaarxiv icon

Learning Semantic Facial Descriptors for Accurate Face Animation

Add code
Jan 29, 2025
Viaarxiv icon

V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer

Add code
Jan 09, 2025
Figure 1 for V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Figure 2 for V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Figure 3 for V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Figure 4 for V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Viaarxiv icon

CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model

Add code
Dec 05, 2024
Figure 1 for CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model
Figure 2 for CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model
Figure 3 for CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model
Figure 4 for CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model
Viaarxiv icon

DRIVE: Dual-Robustness via Information Variability and Entropic Consistency in Source-Free Unsupervised Domain Adaptation

Add code
Nov 24, 2024
Figure 1 for DRIVE: Dual-Robustness via Information Variability and Entropic Consistency in Source-Free Unsupervised Domain Adaptation
Figure 2 for DRIVE: Dual-Robustness via Information Variability and Entropic Consistency in Source-Free Unsupervised Domain Adaptation
Figure 3 for DRIVE: Dual-Robustness via Information Variability and Entropic Consistency in Source-Free Unsupervised Domain Adaptation
Figure 4 for DRIVE: Dual-Robustness via Information Variability and Entropic Consistency in Source-Free Unsupervised Domain Adaptation
Viaarxiv icon

Revisiting the Integration of Convolution and Attention for Vision Backbone

Add code
Nov 21, 2024
Viaarxiv icon

Federated Domain Generalization via Prompt Learning and Aggregation

Add code
Nov 15, 2024
Figure 1 for Federated Domain Generalization via Prompt Learning and Aggregation
Figure 2 for Federated Domain Generalization via Prompt Learning and Aggregation
Figure 3 for Federated Domain Generalization via Prompt Learning and Aggregation
Figure 4 for Federated Domain Generalization via Prompt Learning and Aggregation
Viaarxiv icon

UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation

Add code
Nov 13, 2024
Figure 1 for UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation
Figure 2 for UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation
Figure 3 for UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation
Figure 4 for UIFormer: A Unified Transformer-based Framework for Incremental Few-Shot Object Detection and Instance Segmentation
Viaarxiv icon

Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?

Add code
Nov 06, 2024
Figure 1 for Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?
Figure 2 for Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?
Figure 3 for Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?
Figure 4 for Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?
Viaarxiv icon

Where Do We Stand with Implicit Neural Representations? A Technical and Performance Survey

Add code
Nov 06, 2024
Figure 1 for Where Do We Stand with Implicit Neural Representations? A Technical and Performance Survey
Figure 2 for Where Do We Stand with Implicit Neural Representations? A Technical and Performance Survey
Figure 3 for Where Do We Stand with Implicit Neural Representations? A Technical and Performance Survey
Figure 4 for Where Do We Stand with Implicit Neural Representations? A Technical and Performance Survey
Viaarxiv icon