Picture for Heming Du

Heming Du

MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset

Add code
Oct 25, 2024
Figure 1 for MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset
Figure 2 for MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset
Figure 3 for MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset
Figure 4 for MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset
Viaarxiv icon

Diverse Sign Language Translation

Add code
Oct 25, 2024
Viaarxiv icon

TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm

Add code
Sep 30, 2024
Figure 1 for TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm
Figure 2 for TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm
Figure 3 for TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm
Figure 4 for TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm
Viaarxiv icon

Perceive, Reflect, and Plan: Designing LLM Agent for Goal-Directed City Navigation without Instructions

Add code
Aug 08, 2024
Viaarxiv icon

Affective Behaviour Analysis via Integrating Multi-Modal Knowledge

Add code
Mar 16, 2024
Viaarxiv icon

Divide and Ensemble: Progressively Learning for the Unknown

Add code
Oct 09, 2023
Viaarxiv icon

When 3D Bounding-Box Meets SAM: Point Cloud Instance Segmentation with Weak-and-Noisy Supervision

Add code
Sep 02, 2023
Viaarxiv icon

RVD: A Handheld Device-Based Fundus Video Dataset for Retinal Vessel Segmentation

Add code
Jul 13, 2023
Viaarxiv icon

SEFormer: Structure Embedding Transformer for 3D Object Detection

Add code
Sep 05, 2022
Figure 1 for SEFormer: Structure Embedding Transformer for 3D Object Detection
Figure 2 for SEFormer: Structure Embedding Transformer for 3D Object Detection
Figure 3 for SEFormer: Structure Embedding Transformer for 3D Object Detection
Figure 4 for SEFormer: Structure Embedding Transformer for 3D Object Detection
Viaarxiv icon

VTNet: Visual Transformer Network for Object Goal Navigation

Add code
May 20, 2021
Figure 1 for VTNet: Visual Transformer Network for Object Goal Navigation
Figure 2 for VTNet: Visual Transformer Network for Object Goal Navigation
Figure 3 for VTNet: Visual Transformer Network for Object Goal Navigation
Figure 4 for VTNet: Visual Transformer Network for Object Goal Navigation
Viaarxiv icon