Picture for Tianrui Hui

Tianrui Hui

Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding

Add code
Sep 12, 2024
Figure 1 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 2 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 3 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 4 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Viaarxiv icon

Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object Segmentation

Add code
Aug 28, 2024
Viaarxiv icon

Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training

Add code
Dec 04, 2023
Viaarxiv icon

Enriching Phrases with Coupled Pixel and Object Contexts for Panoptic Narrative Grounding

Add code
Nov 02, 2023
Viaarxiv icon

Cross-Modality Domain Adaptation for Freespace Detection: A Simple yet Effective Baseline

Add code
Oct 06, 2022
Figure 1 for Cross-Modality Domain Adaptation for Freespace Detection: A Simple yet Effective Baseline
Figure 2 for Cross-Modality Domain Adaptation for Freespace Detection: A Simple yet Effective Baseline
Figure 3 for Cross-Modality Domain Adaptation for Freespace Detection: A Simple yet Effective Baseline
Figure 4 for Cross-Modality Domain Adaptation for Freespace Detection: A Simple yet Effective Baseline
Viaarxiv icon

PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding

Add code
Aug 11, 2022
Figure 1 for PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding
Figure 2 for PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding
Figure 3 for PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding
Figure 4 for PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding
Viaarxiv icon

Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation

Add code
Jun 08, 2022
Figure 1 for Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Figure 2 for Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Figure 3 for Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Figure 4 for Language-Bridged Spatial-Temporal Interaction for Referring Video Object Segmentation
Viaarxiv icon

A Keypoint-based Global Association Network for Lane Detection

Add code
Apr 15, 2022
Figure 1 for A Keypoint-based Global Association Network for Lane Detection
Figure 2 for A Keypoint-based Global Association Network for Lane Detection
Figure 3 for A Keypoint-based Global Association Network for Lane Detection
Figure 4 for A Keypoint-based Global Association Network for Lane Detection
Viaarxiv icon

TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding

Add code
Aug 11, 2021
Figure 1 for TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding
Figure 2 for TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding
Figure 3 for TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding
Figure 4 for TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding
Viaarxiv icon

Cross-Modal Progressive Comprehension for Referring Segmentation

Add code
May 15, 2021
Figure 1 for Cross-Modal Progressive Comprehension for Referring Segmentation
Figure 2 for Cross-Modal Progressive Comprehension for Referring Segmentation
Figure 3 for Cross-Modal Progressive Comprehension for Referring Segmentation
Figure 4 for Cross-Modal Progressive Comprehension for Referring Segmentation
Viaarxiv icon