Picture for Hong Cai

Hong Cai

Distilling Multi-modal Large Language Models for Autonomous Driving

Add code
Jan 16, 2025
Viaarxiv icon

Planar Gaussian Splatting

Add code
Dec 02, 2024
Viaarxiv icon

PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer

Add code
Jul 16, 2024
Figure 1 for PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer
Figure 2 for PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer
Figure 3 for PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer
Figure 4 for PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer
Viaarxiv icon

ToSA: Token Selective Attention for Efficient Vision Transformers

Add code
Jun 13, 2024
Viaarxiv icon

SciFlow: Empowering Lightweight Optical Flow Models with Self-Cleaning Iterations

Add code
Apr 11, 2024
Figure 1 for SciFlow: Empowering Lightweight Optical Flow Models with Self-Cleaning Iterations
Figure 2 for SciFlow: Empowering Lightweight Optical Flow Models with Self-Cleaning Iterations
Figure 3 for SciFlow: Empowering Lightweight Optical Flow Models with Self-Cleaning Iterations
Figure 4 for SciFlow: Empowering Lightweight Optical Flow Models with Self-Cleaning Iterations
Viaarxiv icon

OCAI: Improving Optical Flow Estimation by Occlusion and Consistency Aware Interpolation

Add code
Mar 26, 2024
Figure 1 for OCAI: Improving Optical Flow Estimation by Occlusion and Consistency Aware Interpolation
Figure 2 for OCAI: Improving Optical Flow Estimation by Occlusion and Consistency Aware Interpolation
Figure 3 for OCAI: Improving Optical Flow Estimation by Occlusion and Consistency Aware Interpolation
Figure 4 for OCAI: Improving Optical Flow Estimation by Occlusion and Consistency Aware Interpolation
Viaarxiv icon

FutureDepth: Learning to Predict the Future Improves Video Depth Estimation

Add code
Mar 19, 2024
Figure 1 for FutureDepth: Learning to Predict the Future Improves Video Depth Estimation
Figure 2 for FutureDepth: Learning to Predict the Future Improves Video Depth Estimation
Figure 3 for FutureDepth: Learning to Predict the Future Improves Video Depth Estimation
Figure 4 for FutureDepth: Learning to Predict the Future Improves Video Depth Estimation
Viaarxiv icon

DeCoTR: Enhancing Depth Completion with 2D and 3D Attentions

Add code
Mar 18, 2024
Viaarxiv icon

Neural Mesh Fusion: Unsupervised 3D Planar Surface Understanding

Add code
Feb 26, 2024
Viaarxiv icon

HexaGen3D: StableDiffusion is just one step away from Fast and Diverse Text-to-3D Generation

Add code
Jan 15, 2024
Viaarxiv icon