Picture for Xiaoyong Wei

Xiaoyong Wei

Sichuan University, Hong Kong Polytechnic Univeristy

Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval

Add code
Jul 23, 2024
Figure 1 for Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Figure 2 for Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Figure 3 for Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Figure 4 for Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
Viaarxiv icon

SE Territory: Monaural Speech Enhancement Meets the Fixed Virtual Perceptual Space Mapping

Add code
Nov 03, 2023
Viaarxiv icon

UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning

Add code
Jun 01, 2023
Viaarxiv icon

Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval

Add code
Jun 17, 2022
Figure 1 for Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Figure 2 for Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Figure 3 for Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Figure 4 for Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Viaarxiv icon

Indicative Image Retrieval: Turning Blackbox Learning into Grey

Add code
Jan 28, 2022
Figure 1 for Indicative Image Retrieval: Turning Blackbox Learning into Grey
Figure 2 for Indicative Image Retrieval: Turning Blackbox Learning into Grey
Figure 3 for Indicative Image Retrieval: Turning Blackbox Learning into Grey
Figure 4 for Indicative Image Retrieval: Turning Blackbox Learning into Grey
Viaarxiv icon

Deep learning-based person re-identification methods: A survey and outlook of recent works

Add code
Oct 16, 2021
Figure 1 for Deep learning-based person re-identification methods: A survey and outlook of recent works
Figure 2 for Deep learning-based person re-identification methods: A survey and outlook of recent works
Figure 3 for Deep learning-based person re-identification methods: A survey and outlook of recent works
Figure 4 for Deep learning-based person re-identification methods: A survey and outlook of recent works
Viaarxiv icon

Global-Local Dynamic Feature Alignment Network for Person Re-Identification

Add code
Sep 13, 2021
Figure 1 for Global-Local Dynamic Feature Alignment Network for Person Re-Identification
Figure 2 for Global-Local Dynamic Feature Alignment Network for Person Re-Identification
Figure 3 for Global-Local Dynamic Feature Alignment Network for Person Re-Identification
Figure 4 for Global-Local Dynamic Feature Alignment Network for Person Re-Identification
Viaarxiv icon

M5Product: A Multi-modal Pretraining Benchmark for E-commercial Product Downstream Tasks

Add code
Sep 09, 2021
Figure 1 for M5Product: A Multi-modal Pretraining Benchmark for E-commercial Product Downstream Tasks
Figure 2 for M5Product: A Multi-modal Pretraining Benchmark for E-commercial Product Downstream Tasks
Figure 3 for M5Product: A Multi-modal Pretraining Benchmark for E-commercial Product Downstream Tasks
Figure 4 for M5Product: A Multi-modal Pretraining Benchmark for E-commercial Product Downstream Tasks
Viaarxiv icon

ParNet: Position-aware Aggregated Relation Network for Image-Text matching

Add code
Jun 17, 2019
Figure 1 for ParNet: Position-aware Aggregated Relation Network for Image-Text matching
Figure 2 for ParNet: Position-aware Aggregated Relation Network for Image-Text matching
Figure 3 for ParNet: Position-aware Aggregated Relation Network for Image-Text matching
Figure 4 for ParNet: Position-aware Aggregated Relation Network for Image-Text matching
Viaarxiv icon