Picture for Qibin Hou

Qibin Hou

DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation

Add code
Apr 07, 2025
Viaarxiv icon

Re-Aligning Language to Visual Objects with an Agentic Workflow

Add code
Mar 30, 2025
Viaarxiv icon

KAC: Kolmogorov-Arnold Classifier for Continual Learning

Add code
Mar 27, 2025
Viaarxiv icon

AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction

Add code
Mar 17, 2025
Viaarxiv icon

K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs

Add code
Feb 25, 2025
Viaarxiv icon

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Add code
Feb 10, 2025
Viaarxiv icon

LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding

Add code
Jan 09, 2025
Figure 1 for LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding
Figure 2 for LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding
Figure 3 for LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding
Figure 4 for LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding
Viaarxiv icon

Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection

Add code
Jan 08, 2025
Viaarxiv icon

SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection

Add code
Dec 30, 2024
Viaarxiv icon

TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction

Add code
Dec 22, 2024
Figure 1 for TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
Figure 2 for TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
Figure 3 for TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
Figure 4 for TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
Viaarxiv icon