Picture for Hongchen Wei

Hongchen Wei

LongCaptioning: Unlocking the Power of Long Caption Generation in Large Multimodal Models

Add code
Feb 21, 2025
Viaarxiv icon

Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model

Add code
Feb 19, 2025
Viaarxiv icon

Visual Context Window Extension: A New Perspective for Long Video Understanding

Add code
Sep 30, 2024
Viaarxiv icon

Improving Generalization of Image Captioning with Unsupervised Prompt Learning

Add code
Aug 05, 2023
Viaarxiv icon

Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning

Add code
Oct 28, 2021
Figure 1 for Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning
Figure 2 for Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning
Figure 3 for Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning
Figure 4 for Exploiting Cross-Modal Prediction and Relation Consistency for Semi-Supervised Image Captioning
Viaarxiv icon