Picture for Meishan Zhang

Meishan Zhang

Towards Text-Image Interleaved Retrieval

Add code
Feb 18, 2025
Viaarxiv icon

Semantic Role Labeling: A Systematical Survey

Add code
Feb 09, 2025
Viaarxiv icon

GME: Improving Universal Multimodal Retrieval by Multimodal LLMs

Add code
Dec 22, 2024
Viaarxiv icon

Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark

Add code
Dec 03, 2024
Figure 1 for Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark
Figure 2 for Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark
Figure 3 for Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark
Figure 4 for Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark
Viaarxiv icon

Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image

Add code
Oct 20, 2024
Figure 1 for Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
Figure 2 for Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
Figure 3 for Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
Figure 4 for Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
Viaarxiv icon

An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation

Add code
Aug 16, 2024
Figure 1 for An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation
Figure 2 for An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation
Figure 3 for An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation
Figure 4 for An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation
Viaarxiv icon

mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval

Add code
Jul 29, 2024
Figure 1 for mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Figure 2 for mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Figure 3 for mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Figure 4 for mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval
Viaarxiv icon

Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment

Add code
Jun 27, 2024
Figure 1 for Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment
Figure 2 for Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment
Figure 3 for Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment
Figure 4 for Enhancing Video-Language Representations with Structural Spatio-Temporal Alignment
Viaarxiv icon

LLM-Driven Multimodal Opinion Expression Identification

Add code
Jun 26, 2024
Figure 1 for LLM-Driven Multimodal Opinion Expression Identification
Figure 2 for LLM-Driven Multimodal Opinion Expression Identification
Figure 3 for LLM-Driven Multimodal Opinion Expression Identification
Figure 4 for LLM-Driven Multimodal Opinion Expression Identification
Viaarxiv icon

Retrieval-style In-Context Learning for Few-shot Hierarchical Text Classification

Add code
Jun 25, 2024
Viaarxiv icon