Picture for Li Zhao

Li Zhao

Senior Member, IEEE

COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detection

Add code
Nov 28, 2024
Viaarxiv icon

DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction

Add code
Nov 07, 2024
Figure 1 for DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction
Figure 2 for DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction
Figure 3 for DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction
Figure 4 for DanceFusion: A Spatio-Temporal Skeleton Diffusion Transformer for Audio-Driven Dance Motion Reconstruction
Viaarxiv icon

Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL

Add code
Jul 20, 2024
Viaarxiv icon

Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection

Add code
Jul 17, 2024
Figure 1 for Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection
Figure 2 for Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection
Figure 3 for Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection
Figure 4 for Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection
Viaarxiv icon

Video In-context Learning

Add code
Jul 10, 2024
Viaarxiv icon

DPO Meets PPO: Reinforced Token Optimization for RLHF

Add code
Apr 29, 2024
Figure 1 for DPO Meets PPO: Reinforced Token Optimization for RLHF
Figure 2 for DPO Meets PPO: Reinforced Token Optimization for RLHF
Figure 3 for DPO Meets PPO: Reinforced Token Optimization for RLHF
Figure 4 for DPO Meets PPO: Reinforced Token Optimization for RLHF
Viaarxiv icon

Empowering Large Language Models on Robotic Manipulation with Affordance Prompting

Add code
Apr 17, 2024
Figure 1 for Empowering Large Language Models on Robotic Manipulation with Affordance Prompting
Figure 2 for Empowering Large Language Models on Robotic Manipulation with Affordance Prompting
Figure 3 for Empowering Large Language Models on Robotic Manipulation with Affordance Prompting
Figure 4 for Empowering Large Language Models on Robotic Manipulation with Affordance Prompting
Viaarxiv icon

VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning

Add code
Jan 04, 2024
Viaarxiv icon

Layer-Adapted Implicit Distribution Alignment Networks for Cross-Corpus Speech Emotion Recognition

Add code
Oct 06, 2023
Viaarxiv icon

Pre-Trained Large Language Models for Industrial Control

Add code
Aug 06, 2023
Viaarxiv icon