Picture for Pai Peng

Pai Peng

School of Mathematics and Computer Science, Jianghan University

You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving

Add code
Feb 28, 2025
Viaarxiv icon

Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving

Add code
Jan 15, 2025
Viaarxiv icon

Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification

Add code
Dec 28, 2024
Figure 1 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Figure 2 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Figure 3 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Figure 4 for Cross-Modal Mapping: Eliminating the Modality Gap for Few-Shot Image Classification
Viaarxiv icon

Light-weight End-to-End Graph Interest Network for CTR Prediction in E-commerce Search

Add code
Jun 25, 2024
Viaarxiv icon

Batch-in-Batch: a new adversarial training framework for initial perturbation and sample selection

Add code
Jun 06, 2024
Viaarxiv icon

Open-Vocabulary Object Detection via Neighboring Region Attention Alignment

Add code
May 14, 2024
Viaarxiv icon

MWSIS: Multimodal Weakly Supervised Instance Segmentation with 2D Box Annotations for Autonomous Driving

Add code
Dec 17, 2023
Viaarxiv icon

FECANet: Boosting Few-Shot Semantic Segmentation with Feature-Enhanced Context-Aware Network

Add code
Jan 19, 2023
Viaarxiv icon

Locate before Answering: Answer Guided Question Localization for Video Question Answering

Add code
Oct 05, 2022
Figure 1 for Locate before Answering: Answer Guided Question Localization for Video Question Answering
Figure 2 for Locate before Answering: Answer Guided Question Localization for Video Question Answering
Figure 3 for Locate before Answering: Answer Guided Question Localization for Video Question Answering
Figure 4 for Locate before Answering: Answer Guided Question Localization for Video Question Answering
Viaarxiv icon

An Efficient End-to-End Transformer with Progressive Tri-modal Attention for Multi-modal Emotion Recognition

Add code
Sep 20, 2022
Figure 1 for An Efficient End-to-End Transformer with Progressive Tri-modal Attention for Multi-modal Emotion Recognition
Figure 2 for An Efficient End-to-End Transformer with Progressive Tri-modal Attention for Multi-modal Emotion Recognition
Figure 3 for An Efficient End-to-End Transformer with Progressive Tri-modal Attention for Multi-modal Emotion Recognition
Figure 4 for An Efficient End-to-End Transformer with Progressive Tri-modal Attention for Multi-modal Emotion Recognition
Viaarxiv icon