Picture for Rui Huang

Rui Huang

College of Computer Science and Technology, Civil Aviation University of China, China

DEFOM-Stereo: Depth Foundation Model Based Stereo Matching

Add code
Jan 16, 2025
Viaarxiv icon

Fine-grained Video-Text Retrieval: A New Benchmark and Method

Add code
Dec 31, 2024
Viaarxiv icon

CRM: Retrieval Model with Controllable Condition

Add code
Dec 18, 2024
Viaarxiv icon

Grasp What You Want: Embodied Dexterous Grasping System Driven by Your Voice

Add code
Dec 14, 2024
Viaarxiv icon

GTPC-SSCD: Gate-guided Two-level Perturbation Consistency-based Semi-Supervised Change Detection

Add code
Nov 28, 2024
Viaarxiv icon

Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data

Add code
Nov 23, 2024
Figure 1 for Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Figure 2 for Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Figure 3 for Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Figure 4 for Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Viaarxiv icon

QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou

Add code
Nov 18, 2024
Figure 1 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Figure 2 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Figure 3 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Figure 4 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Viaarxiv icon

KuaiFormer: Transformer-Based Retrieval at Kuaishou

Add code
Nov 15, 2024
Viaarxiv icon

CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation

Add code
Nov 07, 2024
Figure 1 for CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation
Figure 2 for CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation
Figure 3 for CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation
Figure 4 for CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation
Viaarxiv icon

A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model

Add code
Nov 07, 2024
Viaarxiv icon