Picture for Jiahao Huo

Jiahao Huo

CausalEmbed: Auto-Regressive Multi-Vector Generation in Latent Space for Visual Document Embedding

Add code
Jan 29, 2026
Viaarxiv icon

Memory in the Age of AI Agents

Add code
Dec 15, 2025
Viaarxiv icon

Improving Wildlife Out-of-Distribution Detection: Africas Big Five

Add code
Jun 07, 2025
Viaarxiv icon

MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models

Add code
May 28, 2025
Figure 1 for MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models
Figure 2 for MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models
Figure 3 for MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models
Figure 4 for MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models
Viaarxiv icon

Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis

Add code
May 21, 2025
Viaarxiv icon

EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning

Add code
Feb 05, 2025
Figure 1 for Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Figure 2 for Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Figure 3 for Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Figure 4 for Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Viaarxiv icon

Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey

Add code
Dec 03, 2024
Figure 1 for Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Figure 2 for Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Figure 3 for Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Figure 4 for Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey
Viaarxiv icon

MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models

Add code
Oct 07, 2024
Figure 1 for MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models
Figure 2 for MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models
Figure 3 for MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models
Figure 4 for MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models
Viaarxiv icon

ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection

Add code
Oct 06, 2024
Figure 1 for ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Figure 2 for ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Figure 3 for ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Figure 4 for ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Viaarxiv icon