Picture for Shanshan Zhao

Shanshan Zhao

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Add code
May 05, 2025
Viaarxiv icon

JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation

Add code
Mar 31, 2025
Viaarxiv icon

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation

Add code
Dec 25, 2024
Figure 1 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 2 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 3 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Figure 4 for UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
Viaarxiv icon

Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation

Add code
Sep 04, 2024
Figure 1 for Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
Figure 2 for Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
Figure 3 for Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
Figure 4 for Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
Viaarxiv icon

Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment

Add code
Aug 29, 2024
Figure 1 for Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment
Figure 2 for Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment
Figure 3 for Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment
Figure 4 for Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment
Viaarxiv icon

UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather

Add code
Apr 08, 2024
Figure 1 for UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
Figure 2 for UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
Figure 3 for UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
Figure 4 for UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
Viaarxiv icon

Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis

Add code
Mar 17, 2024
Figure 1 for Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis
Figure 2 for Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis
Figure 3 for Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis
Figure 4 for Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis
Viaarxiv icon

When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability

Add code
Mar 01, 2024
Figure 1 for When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Figure 2 for When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Figure 3 for When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Figure 4 for When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability
Viaarxiv icon

ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding

Add code
Dec 18, 2023
Figure 1 for ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding
Figure 2 for ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding
Figure 3 for ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding
Figure 4 for ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding
Viaarxiv icon

Optical Quantum Sensing for Agnostic Environments via Deep Learning

Add code
Nov 13, 2023
Figure 1 for Optical Quantum Sensing for Agnostic Environments via Deep Learning
Figure 2 for Optical Quantum Sensing for Agnostic Environments via Deep Learning
Figure 3 for Optical Quantum Sensing for Agnostic Environments via Deep Learning
Figure 4 for Optical Quantum Sensing for Agnostic Environments via Deep Learning
Viaarxiv icon