Picture for Peng Zhai

Peng Zhai

FysicsWorld: A Unified Full-Modality Benchmark for Any-to-Any Understanding, Generation, and Reasoning

Add code
Dec 14, 2025
Viaarxiv icon

Improving Multimodal Sentiment Analysis via Modality Optimization and Dynamic Primary Modality Selection

Add code
Nov 14, 2025
Viaarxiv icon

RENet: Fault-Tolerant Motion Control for Quadruped Robots via Redundant Estimator Networks under Visual Collapse

Add code
Sep 11, 2025
Viaarxiv icon

Music-Driven Legged Robots: Synchronized Walking to Rhythmic Beats

Add code
Mar 06, 2025
Viaarxiv icon

Continuous Control of Diverse Skills in Quadruped Robots Without Complete Expert Datasets

Add code
Mar 05, 2025
Viaarxiv icon

Role Play: Learning Adaptive Role-Specific Strategies in Multi-Agent Interactions

Add code
Nov 02, 2024
Viaarxiv icon

A Robust Quadruped Robot with Twisting Waist for Flexible Motions

Add code
Oct 08, 2024
Figure 1 for A Robust Quadruped Robot with Twisting Waist for Flexible Motions
Figure 2 for A Robust Quadruped Robot with Twisting Waist for Flexible Motions
Figure 3 for A Robust Quadruped Robot with Twisting Waist for Flexible Motions
Figure 4 for A Robust Quadruped Robot with Twisting Waist for Flexible Motions
Viaarxiv icon

Large Vision-Language Models as Emotion Recognizers in Context Awareness

Add code
Jul 16, 2024
Figure 1 for Large Vision-Language Models as Emotion Recognizers in Context Awareness
Figure 2 for Large Vision-Language Models as Emotion Recognizers in Context Awareness
Figure 3 for Large Vision-Language Models as Emotion Recognizers in Context Awareness
Figure 4 for Large Vision-Language Models as Emotion Recognizers in Context Awareness
Viaarxiv icon

Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations

Add code
Jul 06, 2024
Figure 1 for Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations
Figure 2 for Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations
Figure 3 for Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations
Figure 4 for Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations
Viaarxiv icon

PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications

Add code
May 29, 2024
Figure 1 for PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications
Figure 2 for PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications
Figure 3 for PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications
Figure 4 for PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications
Viaarxiv icon