Picture for Shan He

Shan He

University of Birmingham

Learning to Evolve: A Self-Improving Framework for Multi-Agent Systems via Textual Parameter Graph Optimization

Add code
Apr 22, 2026
Viaarxiv icon

SpaceMind: A Modular and Self-Evolving Embodied Vision-Language Agent Framework for Autonomous On-orbit Servicing

Add code
Apr 15, 2026
Viaarxiv icon

Beyond Monologue: Interactive Talking-Listening Avatar Generation with Conversational Audio Context-Aware Kernels

Add code
Apr 11, 2026
Viaarxiv icon

EARTalking: End-to-end GPT-style Autoregressive Talking Head Synthesis with Frame-wise Control

Add code
Mar 19, 2026
Viaarxiv icon

REST: Diffusion-based Real-time End-to-end Streaming Talking Head Generation via ID-Context Caching and Asynchronous Streaming Distillation

Add code
Dec 12, 2025
Viaarxiv icon

Improving Swimming Performance in Soft Robotic Fish with Distributed Muscles and Embedded Kinematic Sensing

Add code
Apr 15, 2025
Figure 1 for Improving Swimming Performance in Soft Robotic Fish with Distributed Muscles and Embedded Kinematic Sensing
Figure 2 for Improving Swimming Performance in Soft Robotic Fish with Distributed Muscles and Embedded Kinematic Sensing
Figure 3 for Improving Swimming Performance in Soft Robotic Fish with Distributed Muscles and Embedded Kinematic Sensing
Figure 4 for Improving Swimming Performance in Soft Robotic Fish with Distributed Muscles and Embedded Kinematic Sensing
Viaarxiv icon

MC-GRU:a Multi-Channel GRU network for generalized nonlinear structural response prediction across structures

Add code
Mar 10, 2025
Figure 1 for MC-GRU:a Multi-Channel GRU network for generalized nonlinear structural response prediction across structures
Figure 2 for MC-GRU:a Multi-Channel GRU network for generalized nonlinear structural response prediction across structures
Figure 3 for MC-GRU:a Multi-Channel GRU network for generalized nonlinear structural response prediction across structures
Figure 4 for MC-GRU:a Multi-Channel GRU network for generalized nonlinear structural response prediction across structures
Viaarxiv icon

A Real-time Spatio-Temporal Trajectory Planner for Autonomous Vehicles with Semantic Graph Optimization

Add code
Feb 25, 2025
Figure 1 for A Real-time Spatio-Temporal Trajectory Planner for Autonomous Vehicles with Semantic Graph Optimization
Figure 2 for A Real-time Spatio-Temporal Trajectory Planner for Autonomous Vehicles with Semantic Graph Optimization
Figure 3 for A Real-time Spatio-Temporal Trajectory Planner for Autonomous Vehicles with Semantic Graph Optimization
Figure 4 for A Real-time Spatio-Temporal Trajectory Planner for Autonomous Vehicles with Semantic Graph Optimization
Viaarxiv icon

PoAct: Policy and Action Dual-Control Agent for Generalized Applications

Add code
Jan 13, 2025
Figure 1 for PoAct: Policy and Action Dual-Control Agent for Generalized Applications
Figure 2 for PoAct: Policy and Action Dual-Control Agent for Generalized Applications
Figure 3 for PoAct: Policy and Action Dual-Control Agent for Generalized Applications
Figure 4 for PoAct: Policy and Action Dual-Control Agent for Generalized Applications
Viaarxiv icon

EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion

Add code
Nov 23, 2024
Figure 1 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 2 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 3 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Figure 4 for EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
Viaarxiv icon