Picture for Qi Jia

Qi Jia

UniDial-EvalKit: A Unified Toolkit for Evaluating Multi-Faceted Conversational Abilities

Add code
Mar 24, 2026
Viaarxiv icon

CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels

Add code
Mar 22, 2026
Viaarxiv icon

SafeSci: Safety Evaluation of Large Language Models in Science Domains and Beyond

Add code
Mar 02, 2026
Viaarxiv icon

VideoAesBench: Benchmarking the Video Aesthetics Perception Capabilities of Large Multimodal Models

Add code
Jan 29, 2026
Viaarxiv icon

Automated Safety Benchmarking: A Multi-agent Pipeline for LVLMs

Add code
Jan 27, 2026
Viaarxiv icon

Q-Bench-Portrait: Benchmarking Multimodal Large Language Models on Portrait Image Quality Perception

Add code
Jan 26, 2026
Viaarxiv icon

RSOD: Reliability-Guided Sonar Image Object Detection with Extremely Limited Labels

Add code
Jan 19, 2026
Viaarxiv icon

KidVis: Do Multimodal Large Language Models Possess the Visual Perceptual Capabilities of a 6-Year-Old?

Add code
Jan 13, 2026
Viaarxiv icon

EvolMem: A Cognitive-Driven Benchmark for Multi-Session Dialogue Memory

Add code
Jan 07, 2026
Viaarxiv icon

Generating Storytelling Images with Rich Chains-of-Reasoning

Add code
Dec 08, 2025
Viaarxiv icon