Picture for Yaosi Hu

Yaosi Hu

LongCaptioning: Unlocking the Power of Long Caption Generation in Large Multimodal Models

Add code
Feb 21, 2025
Viaarxiv icon

Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model

Add code
Feb 19, 2025
Viaarxiv icon

TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation

Add code
Dec 13, 2024
Figure 1 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 2 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 3 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Figure 4 for TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation
Viaarxiv icon

SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control

Add code
Mar 28, 2024
Figure 1 for SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control
Figure 2 for SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control
Figure 3 for SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control
Figure 4 for SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control
Viaarxiv icon

LaMD: Latent Motion Diffusion for Video Generation

Add code
Apr 23, 2023
Viaarxiv icon

Learning Human Cognitive Appraisal Through Reinforcement Memory Unit

Add code
Aug 06, 2022
Figure 1 for Learning Human Cognitive Appraisal Through Reinforcement Memory Unit
Figure 2 for Learning Human Cognitive Appraisal Through Reinforcement Memory Unit
Figure 3 for Learning Human Cognitive Appraisal Through Reinforcement Memory Unit
Figure 4 for Learning Human Cognitive Appraisal Through Reinforcement Memory Unit
Viaarxiv icon

Make It Move: Controllable Image-to-Video Generation with Text Descriptions

Add code
Dec 06, 2021
Figure 1 for Make It Move: Controllable Image-to-Video Generation with Text Descriptions
Figure 2 for Make It Move: Controllable Image-to-Video Generation with Text Descriptions
Figure 3 for Make It Move: Controllable Image-to-Video Generation with Text Descriptions
Figure 4 for Make It Move: Controllable Image-to-Video Generation with Text Descriptions
Viaarxiv icon

Predicate correlation learning for scene graph generation

Add code
Jul 06, 2021
Figure 1 for Predicate correlation learning for scene graph generation
Figure 2 for Predicate correlation learning for scene graph generation
Figure 3 for Predicate correlation learning for scene graph generation
Figure 4 for Predicate correlation learning for scene graph generation
Viaarxiv icon