Picture for Jian-Fang Hu

Jian-Fang Hu

ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations

Add code
Jan 24, 2025
Viaarxiv icon

SAUGE: Taming SAM for Uncertainty-Aligned Multi-Granularity Edge Detection

Add code
Dec 17, 2024
Viaarxiv icon

TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching

Add code
Nov 26, 2024
Figure 1 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Figure 2 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Figure 3 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Figure 4 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Viaarxiv icon

SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses

Add code
Aug 07, 2024
Figure 1 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 2 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 3 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 4 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Viaarxiv icon

Progressive Pretext Task Learning for Human Trajectory Prediction

Add code
Jul 16, 2024
Viaarxiv icon

Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels

Add code
Mar 21, 2024
Viaarxiv icon

Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding

Add code
Mar 18, 2024
Viaarxiv icon

Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model

Add code
Mar 17, 2024
Figure 1 for Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model
Figure 2 for Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model
Figure 3 for Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model
Figure 4 for Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model
Viaarxiv icon

STVGFormer: Spatio-Temporal Video Grounding with Static-Dynamic Cross-Modal Understanding

Add code
Jul 06, 2022
Figure 1 for STVGFormer: Spatio-Temporal Video Grounding with Static-Dynamic Cross-Modal Understanding
Figure 2 for STVGFormer: Spatio-Temporal Video Grounding with Static-Dynamic Cross-Modal Understanding
Figure 3 for STVGFormer: Spatio-Temporal Video Grounding with Static-Dynamic Cross-Modal Understanding
Viaarxiv icon

Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding

Add code
Jun 20, 2021
Figure 1 for Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding
Figure 2 for Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding
Figure 3 for Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding
Figure 4 for Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding
Viaarxiv icon