Picture for Rui Huang

Rui Huang

College of Computer Science and Technology, Civil Aviation University of China, China

Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening

Add code
Feb 07, 2025
Viaarxiv icon

PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression

Add code
Feb 07, 2025
Viaarxiv icon

DEFOM-Stereo: Depth Foundation Model Based Stereo Matching

Add code
Jan 16, 2025
Viaarxiv icon

Fine-grained Video-Text Retrieval: A New Benchmark and Method

Add code
Dec 31, 2024
Viaarxiv icon

CRM: Retrieval Model with Controllable Condition

Add code
Dec 18, 2024
Viaarxiv icon

Grasp What You Want: Embodied Dexterous Grasping System Driven by Your Voice

Add code
Dec 14, 2024
Viaarxiv icon

GTPC-SSCD: Gate-guided Two-level Perturbation Consistency-based Semi-Supervised Change Detection

Add code
Nov 28, 2024
Viaarxiv icon

Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data

Add code
Nov 23, 2024
Figure 1 for Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Figure 2 for Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Figure 3 for Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Figure 4 for Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Viaarxiv icon

QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou

Add code
Nov 18, 2024
Figure 1 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Figure 2 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Figure 3 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Figure 4 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Viaarxiv icon

KuaiFormer: Transformer-Based Retrieval at Kuaishou

Add code
Nov 15, 2024
Viaarxiv icon