Picture for Zaijing Li

Zaijing Li

Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

Add code
Feb 22, 2026
Viaarxiv icon

Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills

Add code
Jun 12, 2025
Figure 1 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 2 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 3 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 4 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Viaarxiv icon

Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts

Add code
Jun 12, 2025
Figure 1 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Figure 2 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Figure 3 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Figure 4 for Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts
Viaarxiv icon

Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy

Add code
Feb 27, 2025
Figure 1 for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
Figure 2 for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
Figure 3 for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
Figure 4 for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
Viaarxiv icon

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Add code
Aug 07, 2024
Viaarxiv icon

HCQA @ Ego4D EgoSchema Challenge 2024

Add code
Jun 22, 2024
Figure 1 for HCQA @ Ego4D EgoSchema Challenge 2024
Figure 2 for HCQA @ Ego4D EgoSchema Challenge 2024
Figure 3 for HCQA @ Ego4D EgoSchema Challenge 2024
Figure 4 for HCQA @ Ego4D EgoSchema Challenge 2024
Viaarxiv icon

ObjectNLQ @ Ego4D Episodic Memory Challenge 2024

Add code
Jun 22, 2024
Figure 1 for ObjectNLQ @ Ego4D Episodic Memory Challenge 2024
Figure 2 for ObjectNLQ @ Ego4D Episodic Memory Challenge 2024
Figure 3 for ObjectNLQ @ Ego4D Episodic Memory Challenge 2024
Figure 4 for ObjectNLQ @ Ego4D Episodic Memory Challenge 2024
Viaarxiv icon

Enhancing the Emotional Generation Capability of Large Language Models via Emotional Chain-of-Thought

Add code
Jan 12, 2024
Viaarxiv icon

UniSA: Unified Generative Framework for Sentiment Analysis

Add code
Sep 04, 2023
Figure 1 for UniSA: Unified Generative Framework for Sentiment Analysis
Figure 2 for UniSA: Unified Generative Framework for Sentiment Analysis
Figure 3 for UniSA: Unified Generative Framework for Sentiment Analysis
Figure 4 for UniSA: Unified Generative Framework for Sentiment Analysis
Viaarxiv icon

EmoCaps: Emotion Capsule based Model for Conversational Emotion Recognition

Add code
Mar 25, 2022
Figure 1 for EmoCaps: Emotion Capsule based Model for Conversational Emotion Recognition
Figure 2 for EmoCaps: Emotion Capsule based Model for Conversational Emotion Recognition
Figure 3 for EmoCaps: Emotion Capsule based Model for Conversational Emotion Recognition
Figure 4 for EmoCaps: Emotion Capsule based Model for Conversational Emotion Recognition
Viaarxiv icon