Picture for Songhao Han

Songhao Han

Beihang University

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

Add code
Nov 22, 2024
Viaarxiv icon

Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation

Add code
Mar 09, 2024
Viaarxiv icon

LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions

Add code
Nov 20, 2023
Figure 1 for LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions
Figure 2 for LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions
Figure 3 for LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions
Figure 4 for LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions
Viaarxiv icon

VLSNR:Vision-Linguistics Coordination Time Sequence-aware News Recommendation

Add code
Oct 06, 2022
Figure 1 for VLSNR:Vision-Linguistics Coordination Time Sequence-aware News Recommendation
Figure 2 for VLSNR:Vision-Linguistics Coordination Time Sequence-aware News Recommendation
Figure 3 for VLSNR:Vision-Linguistics Coordination Time Sequence-aware News Recommendation
Figure 4 for VLSNR:Vision-Linguistics Coordination Time Sequence-aware News Recommendation
Viaarxiv icon