Picture for Yibo Yan

Yibo Yan

SAFEERASER: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine Unlearning

Add code
Feb 18, 2025
Viaarxiv icon

EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

Position: LLMs Can be Good Tutors in Foreign Language Education

Add code
Feb 08, 2025
Viaarxiv icon

Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning

Add code
Feb 05, 2025
Figure 1 for Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Figure 2 for Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Figure 3 for Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Figure 4 for Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Viaarxiv icon

RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning

Add code
Feb 02, 2025
Figure 1 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Figure 2 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Figure 3 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Figure 4 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Viaarxiv icon

A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges

Add code
Dec 16, 2024
Viaarxiv icon

Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey

Add code
Dec 03, 2024
Figure 1 for Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Figure 2 for Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Figure 3 for Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Figure 4 for Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Viaarxiv icon

Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks

Add code
Nov 27, 2024
Figure 1 for Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks
Figure 2 for Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks
Figure 3 for Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks
Figure 4 for Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks
Viaarxiv icon

Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation

Add code
Nov 26, 2024
Viaarxiv icon

SAVEn-Vid: Synergistic Audio-Visual Integration for Enhanced Understanding in Long Video Context

Add code
Nov 25, 2024
Viaarxiv icon