Picture for Wenmeng Yu

Wenmeng Yu

CogVLM2: Visual Language Models for Image and Video Understanding

Add code
Aug 29, 2024
Figure 1 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 2 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 3 for CogVLM2: Visual Language Models for Image and Video Understanding
Figure 4 for CogVLM2: Visual Language Models for Image and Video Understanding
Viaarxiv icon

AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models

Add code
Jun 14, 2024
Viaarxiv icon

CogAgent: A Visual Language Model for GUI Agents

Add code
Dec 21, 2023
Viaarxiv icon

CogVLM: Visual Expert for Pretrained Language Models

Add code
Nov 06, 2023
Viaarxiv icon

M-SENA: An Integrated Platform for Multimodal Sentiment Analysis

Add code
Mar 23, 2022
Figure 1 for M-SENA: An Integrated Platform for Multimodal Sentiment Analysis
Figure 2 for M-SENA: An Integrated Platform for Multimodal Sentiment Analysis
Figure 3 for M-SENA: An Integrated Platform for Multimodal Sentiment Analysis
Figure 4 for M-SENA: An Integrated Platform for Multimodal Sentiment Analysis
Viaarxiv icon

Learning Modality-Specific Representations with Self-Supervised Multi-Task Learning for Multimodal Sentiment Analysis

Add code
Feb 09, 2021
Figure 1 for Learning Modality-Specific Representations with Self-Supervised Multi-Task Learning for Multimodal Sentiment Analysis
Figure 2 for Learning Modality-Specific Representations with Self-Supervised Multi-Task Learning for Multimodal Sentiment Analysis
Figure 3 for Learning Modality-Specific Representations with Self-Supervised Multi-Task Learning for Multimodal Sentiment Analysis
Figure 4 for Learning Modality-Specific Representations with Self-Supervised Multi-Task Learning for Multimodal Sentiment Analysis
Viaarxiv icon