Picture for Penglei Sun

Penglei Sun

Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective

Add code
Oct 14, 2024
Figure 1 for Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Figure 2 for Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Figure 3 for Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Figure 4 for Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
Viaarxiv icon

3D Question Answering for City Scene Understanding

Add code
Jul 24, 2024
Viaarxiv icon

Multi-Task Domain Adaptation for Language Grounding with 3D Objects

Add code
Jul 03, 2024
Figure 1 for Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Figure 2 for Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Figure 3 for Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Figure 4 for Multi-Task Domain Adaptation for Language Grounding with 3D Objects
Viaarxiv icon

A Contrastive Compositional Benchmark for Text-to-Image Synthesis: A Study with Unified Text-to-Image Fidelity Metrics

Add code
Dec 04, 2023
Viaarxiv icon

Learning 6-DoF Fine-grained Grasp Detection Based on Part Affordance Grounding

Add code
Jan 27, 2023
Viaarxiv icon

Human-in-the-loop Robotic Grasping using BERT Scene Representation

Add code
Sep 28, 2022
Figure 1 for Human-in-the-loop Robotic Grasping using BERT Scene Representation
Figure 2 for Human-in-the-loop Robotic Grasping using BERT Scene Representation
Figure 3 for Human-in-the-loop Robotic Grasping using BERT Scene Representation
Figure 4 for Human-in-the-loop Robotic Grasping using BERT Scene Representation
Viaarxiv icon

Multi-Modal Knowledge Graph Construction and Application: A Survey

Add code
Feb 11, 2022
Viaarxiv icon