Picture for Xuri Ge

Xuri Ge

Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation

Add code
Nov 05, 2024
Figure 1 for Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation
Figure 2 for Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation
Figure 3 for Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation
Figure 4 for Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation
Viaarxiv icon

R^3AG: First Workshop on Refined and Reliable Retrieval Augmented Generation

Add code
Oct 27, 2024
Viaarxiv icon

HpEIS: Learning Hand Pose Embeddings for Multimedia Interactive Systems

Add code
Oct 11, 2024
Figure 1 for HpEIS: Learning Hand Pose Embeddings for Multimedia Interactive Systems
Figure 2 for HpEIS: Learning Hand Pose Embeddings for Multimedia Interactive Systems
Figure 3 for HpEIS: Learning Hand Pose Embeddings for Multimedia Interactive Systems
Figure 4 for HpEIS: Learning Hand Pose Embeddings for Multimedia Interactive Systems
Viaarxiv icon

Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning

Add code
Aug 01, 2024
Figure 1 for Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning
Figure 2 for Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning
Figure 3 for Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning
Figure 4 for Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning
Viaarxiv icon

Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition

Add code
May 26, 2024
Figure 1 for Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition
Figure 2 for Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition
Figure 3 for Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition
Figure 4 for Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition
Viaarxiv icon

3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting

Add code
Apr 26, 2024
Viaarxiv icon

IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT

Add code
Apr 11, 2024
Viaarxiv icon

Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries

Add code
Feb 28, 2024
Figure 1 for Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries
Figure 2 for Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries
Figure 3 for Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries
Figure 4 for Text2Pic Swift: Enhancing Long-Text to Image Retrieval for Large-Scale Libraries
Viaarxiv icon

The Relationship Between Speech Features Changes When You Get Depressed: Feature Correlations for Improving Speed and Performance of Depression Detection

Add code
Jul 07, 2023
Viaarxiv icon

Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval

Add code
Oct 17, 2022
Figure 1 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Figure 2 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Figure 3 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Figure 4 for Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval
Viaarxiv icon