Picture for Yibo Yan

Yibo Yan

Aligning Multimodal LLM with Human Preference: A Survey

Add code
Mar 18, 2025
Viaarxiv icon

LLM Agents for Education: Advances and Applications

Add code
Mar 14, 2025
Viaarxiv icon

SAFEERASER: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine Unlearning

Add code
Feb 18, 2025
Viaarxiv icon

EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

Position: LLMs Can be Good Tutors in Foreign Language Education

Add code
Feb 08, 2025
Viaarxiv icon

Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning

Add code
Feb 05, 2025
Figure 1 for Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Figure 2 for Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Figure 3 for Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Figure 4 for Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Viaarxiv icon

RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning

Add code
Feb 02, 2025
Figure 1 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Figure 2 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Figure 3 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Figure 4 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Viaarxiv icon

A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges

Add code
Dec 16, 2024
Viaarxiv icon

Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey

Add code
Dec 03, 2024
Figure 1 for Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Figure 2 for Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Figure 3 for Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Figure 4 for Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Viaarxiv icon

Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks

Add code
Nov 27, 2024
Figure 1 for Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks
Figure 2 for Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks
Figure 3 for Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks
Figure 4 for Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks
Viaarxiv icon