Picture for Yuqing Song

Yuqing Song

Renmin University of China

Accommodating Audio Modality in CLIP for Multimodal Processing

Add code
Mar 12, 2023
Viaarxiv icon

Unifying Event Detection and Captioning as Sequence Generation via Pre-Training

Add code
Jul 18, 2022
Figure 1 for Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Figure 2 for Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Figure 3 for Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Figure 4 for Unifying Event Detection and Captioning as Sequence Generation via Pre-Training
Viaarxiv icon

Some theoretical results on discrete contour trees

Add code
Jun 24, 2022
Figure 1 for Some theoretical results on discrete contour trees
Figure 2 for Some theoretical results on discrete contour trees
Viaarxiv icon

Progressive Learning for Image Retrieval with Hybrid-Modality Queries

Add code
Apr 24, 2022
Figure 1 for Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Figure 2 for Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Figure 3 for Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Figure 4 for Progressive Learning for Image Retrieval with Hybrid-Modality Queries
Viaarxiv icon

Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training

Add code
Aug 25, 2021
Figure 1 for Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Figure 2 for Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Figure 3 for Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Figure 4 for Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training
Viaarxiv icon

Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization

Add code
Jun 11, 2021
Figure 1 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Figure 2 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Figure 3 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Figure 4 for Team RUC_AIM3 Technical Report at ActivityNet 2021: Entities Object Localization
Viaarxiv icon

Towards Diverse Paragraph Captioning for Untrimmed Videos

Add code
May 30, 2021
Figure 1 for Towards Diverse Paragraph Captioning for Untrimmed Videos
Figure 2 for Towards Diverse Paragraph Captioning for Untrimmed Videos
Figure 3 for Towards Diverse Paragraph Captioning for Untrimmed Videos
Figure 4 for Towards Diverse Paragraph Captioning for Untrimmed Videos
Viaarxiv icon

WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training

Add code
Mar 19, 2021
Figure 1 for WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training
Figure 2 for WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training
Figure 3 for WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training
Figure 4 for WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training
Viaarxiv icon

Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video Captioning

Add code
Jun 14, 2020
Figure 1 for Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video Captioning
Figure 2 for Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video Captioning
Figure 3 for Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video Captioning
Figure 4 for Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video Captioning
Viaarxiv icon

Integrating Temporal and Spatial Attentions for VATEX Video Captioning Challenge 2019

Add code
Oct 15, 2019
Figure 1 for Integrating Temporal and Spatial Attentions for VATEX Video Captioning Challenge 2019
Viaarxiv icon