Picture for Sanyuan Zhao

Sanyuan Zhao

RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception

Add code
Jul 15, 2024
Viaarxiv icon

InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions

Add code
Feb 05, 2024
Viaarxiv icon

TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision

Add code
Jun 06, 2023
Viaarxiv icon

Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for Autonomous Driving

Add code
Feb 08, 2023
Viaarxiv icon

Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering

Add code
Dec 14, 2021
Figure 1 for Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering
Figure 2 for Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering
Figure 3 for Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering
Figure 4 for Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering
Viaarxiv icon

Self-Learning with Rectification Strategy for Human Parsing

Add code
Apr 17, 2020
Figure 1 for Self-Learning with Rectification Strategy for Human Parsing
Figure 2 for Self-Learning with Rectification Strategy for Human Parsing
Figure 3 for Self-Learning with Rectification Strategy for Human Parsing
Figure 4 for Self-Learning with Rectification Strategy for Human Parsing
Viaarxiv icon

Improved Face Detection and Alignment using Cascade Deep Convolutional Network

Add code
Jul 28, 2017
Figure 1 for Improved Face Detection and Alignment using Cascade Deep Convolutional Network
Figure 2 for Improved Face Detection and Alignment using Cascade Deep Convolutional Network
Figure 3 for Improved Face Detection and Alignment using Cascade Deep Convolutional Network
Figure 4 for Improved Face Detection and Alignment using Cascade Deep Convolutional Network
Viaarxiv icon