Picture for Jianzong Wang

Jianzong Wang

Confusion-Aware In-Context-Learning for Vision-Language Models in Robotic Manipulation

Add code
Mar 16, 2026
Viaarxiv icon

Vista: Scene-Aware Optimization for Streaming Video Question Answering under Post-Hoc Queries

Add code
Feb 09, 2026
Viaarxiv icon

Attention-weighted Centered Kernel Alignment for Knowledge Distillation in Large Audio-Language Models Applied to Speech Emotion Recognition

Add code
Feb 02, 2026
Viaarxiv icon

From Knowing to Doing Precisely: A General Self-Correction and Termination Framework for VLA models

Add code
Feb 02, 2026
Viaarxiv icon

MiTa: A Hierarchical Multi-Agent Collaboration Framework with Memory-integrated and Task Allocation

Add code
Jan 30, 2026
Viaarxiv icon

CARE: Multi-Task Pretraining for Latent Continuous Action Representation in Robot Control

Add code
Jan 30, 2026
Viaarxiv icon

MIRRORTALK: Forging Personalized Avatars Via Disentangled Style and Hierarchical Motion Control

Add code
Jan 30, 2026
Viaarxiv icon

Triage: Hierarchical Visual Budgeting for Efficient Video Reasoning in Vision-Language Models

Add code
Jan 30, 2026
Viaarxiv icon

EMO-RL: Emotion-Rule-Based Reinforcement Learning Enhanced Audio-Language Model for Generalized Speech Emotion Recognition

Add code
Sep 19, 2025
Viaarxiv icon

MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts

Add code
Jun 09, 2025
Viaarxiv icon