Picture for Bobo Li

Bobo Li

Audio-Visual Intelligence in Large Foundation Models

Add code
May 05, 2026
Viaarxiv icon

Taming Actor-Observer Asymmetry in Agents via Dialectical Alignment

Add code
Apr 21, 2026
Viaarxiv icon

LASQ: A Low-resource Aspect-based Sentiment Quadruple Extraction Dataset

Add code
Apr 12, 2026
Viaarxiv icon

Orthogonal Spatial-temporal Distributional Transfer for 4D Generation

Add code
Mar 05, 2026
Viaarxiv icon

Synergizing Understanding and Generation with Interleaved Analyzing-Drafting Thinking

Add code
Feb 24, 2026
Viaarxiv icon

Global Commander and Local Operative: A Dual-Agent Framework for Scene Navigation

Add code
Feb 21, 2026
Viaarxiv icon

Unveiling the Cognitive Compass: Theory-of-Mind-Guided Multimodal Emotion Reasoning

Add code
Feb 01, 2026
Viaarxiv icon

Event Extraction in Large Language Model

Add code
Dec 22, 2025
Viaarxiv icon

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

Add code
Nov 11, 2025
Viaarxiv icon

On Path to Multimodal Generalist: General-Level and General-Bench

Add code
May 07, 2025
Viaarxiv icon