Picture for Qibin Hou

Qibin Hou

LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding

Add code
Jan 09, 2025
Viaarxiv icon

Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection

Add code
Jan 08, 2025
Viaarxiv icon

SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection

Add code
Dec 30, 2024
Viaarxiv icon

TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction

Add code
Dec 22, 2024
Viaarxiv icon

MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation

Add code
Dec 16, 2024
Figure 1 for MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
Figure 2 for MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
Figure 3 for MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
Figure 4 for MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation
Viaarxiv icon

DenseVLM: A Retrieval and Decoupled Alignment Framework for Open-Vocabulary Dense Prediction

Add code
Dec 09, 2024
Viaarxiv icon

ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer

Add code
Oct 18, 2024
Figure 1 for ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer
Figure 2 for ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer
Figure 3 for ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer
Figure 4 for ClearSR: Latent Low-Resolution Image Embeddings Help Diffusion-Based Real-World Super Resolution Models See Clearer
Viaarxiv icon

OPUS: Occupancy Prediction Using a Sparse Set

Add code
Sep 14, 2024
Figure 1 for OPUS: Occupancy Prediction Using a Sparse Set
Figure 2 for OPUS: Occupancy Prediction Using a Sparse Set
Figure 3 for OPUS: Occupancy Prediction Using a Sparse Set
Figure 4 for OPUS: Occupancy Prediction Using a Sparse Set
Viaarxiv icon

Towards Stable 3D Object Detection

Add code
Jul 05, 2024
Viaarxiv icon

Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Add code
Jun 02, 2024
Figure 1 for Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Figure 2 for Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Figure 3 for Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Figure 4 for Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
Viaarxiv icon