Picture for Humphrey Shi

Humphrey Shi

CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting

Add code
Dec 26, 2024
Viaarxiv icon

CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices

Add code
Dec 17, 2024
Figure 1 for CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices
Figure 2 for CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices
Figure 3 for CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices
Figure 4 for CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices
Viaarxiv icon

OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation

Add code
Dec 12, 2024
Viaarxiv icon

GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models

Add code
Aug 29, 2024
Viaarxiv icon

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Add code
Aug 28, 2024
Figure 1 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 2 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 3 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Figure 4 for Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
Viaarxiv icon

Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation

Add code
Aug 01, 2024
Figure 1 for Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation
Figure 2 for Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation
Figure 3 for Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation
Figure 4 for Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation
Viaarxiv icon

Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis

Add code
Jun 06, 2024
Figure 1 for Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis
Figure 2 for Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis
Figure 3 for Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis
Figure 4 for Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis
Viaarxiv icon

Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment

Add code
Jun 06, 2024
Figure 1 for Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Figure 2 for Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Figure 3 for Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Figure 4 for Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Viaarxiv icon

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Add code
May 09, 2024
Figure 1 for CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Figure 2 for CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Figure 3 for CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Figure 4 for CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Viaarxiv icon

UVMap-ID: A Controllable and Personalized UV Map Generative Model

Add code
Apr 22, 2024
Figure 1 for UVMap-ID: A Controllable and Personalized UV Map Generative Model
Figure 2 for UVMap-ID: A Controllable and Personalized UV Map Generative Model
Figure 3 for UVMap-ID: A Controllable and Personalized UV Map Generative Model
Figure 4 for UVMap-ID: A Controllable and Personalized UV Map Generative Model
Viaarxiv icon