Picture for Yicong Li

Yicong Li

EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering

Add code
Feb 11, 2025
Viaarxiv icon

Understanding Long Videos via LLM-Powered Entity Relation Graphs

Add code
Jan 27, 2025
Viaarxiv icon

Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models

Add code
Nov 19, 2024
Figure 1 for Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models
Figure 2 for Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models
Figure 3 for Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models
Figure 4 for Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language Models
Viaarxiv icon

Personality Analysis from Online Short Video Platforms with Multi-domain Adaptation

Add code
Oct 26, 2024
Viaarxiv icon

Multimodal Learning for Embryo Viability Prediction in Clinical IVF

Add code
Oct 21, 2024
Viaarxiv icon

MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality

Add code
Aug 17, 2024
Viaarxiv icon

Research on Personalized Compression Algorithm for Pre-trained Models Based on Homomorphic Entropy Increase

Add code
Aug 16, 2024
Viaarxiv icon

VideoQA in the Era of LLMs: An Empirical Study

Add code
Aug 08, 2024
Figure 1 for VideoQA in the Era of LLMs: An Empirical Study
Figure 2 for VideoQA in the Era of LLMs: An Empirical Study
Figure 3 for VideoQA in the Era of LLMs: An Empirical Study
Figure 4 for VideoQA in the Era of LLMs: An Empirical Study
Viaarxiv icon

Toward Structure Fairness in Dynamic Graph Embedding: A Trend-aware Dual Debiasing Approach

Add code
Jun 19, 2024
Viaarxiv icon

Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives

Add code
Jun 09, 2024
Viaarxiv icon