Picture for Hang Ji

Hang Ji

MV-DETR: Multi-modality indoor object detection by Multi-View DEtecton TRansformers

Add code
Aug 13, 2024
Viaarxiv icon

LVIC: Multi-modality segmentation by Lifting Visual Info as Cue

Add code
Mar 08, 2024
Viaarxiv icon

PeP: a Point enhanced Painting method for unified point cloud tasks

Add code
Oct 11, 2023
Viaarxiv icon

HuBo-VLM: Unified Vision-Language Model designed for HUman roBOt interaction tasks

Add code
Aug 24, 2023
Figure 1 for HuBo-VLM: Unified Vision-Language Model designed for HUman roBOt interaction tasks
Figure 2 for HuBo-VLM: Unified Vision-Language Model designed for HUman roBOt interaction tasks
Figure 3 for HuBo-VLM: Unified Vision-Language Model designed for HUman roBOt interaction tasks
Viaarxiv icon

OG: Equip vision occupancy with instance segmentation and visual grounding

Add code
Jul 12, 2023
Viaarxiv icon

OVO: Open-Vocabulary Occupancy

Add code
May 25, 2023
Figure 1 for OVO: Open-Vocabulary Occupancy
Figure 2 for OVO: Open-Vocabulary Occupancy
Figure 3 for OVO: Open-Vocabulary Occupancy
Figure 4 for OVO: Open-Vocabulary Occupancy
Viaarxiv icon

Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models

Add code
Jun 24, 2022
Figure 1 for Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models
Figure 2 for Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models
Figure 3 for Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models
Figure 4 for Predicting within and across language phoneme recognition performance of self-supervised learning speech pre-trained models
Viaarxiv icon