Picture for Xiaoyi Bao

Xiaoyi Bao

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Add code
Nov 13, 2024
Viaarxiv icon

CoReS: Orchestrating the Dance of Reasoning and Segmentation

Add code
Apr 08, 2024
Viaarxiv icon

DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation

Add code
Mar 11, 2024
Figure 1 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 2 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 3 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Figure 4 for DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
Viaarxiv icon

Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model

Add code
Dec 18, 2023
Viaarxiv icon

Relevant Intrinsic Feature Enhancement Network for Few-Shot Semantic Segmentation

Add code
Dec 11, 2023
Viaarxiv icon

Opinion Tree Parsing for Aspect-based Sentiment Analysis

Add code
Jun 15, 2023
Viaarxiv icon

Interpreting Hierarchical Linguistic Interactions in DNNs

Add code
Jun 29, 2020
Figure 1 for Interpreting Hierarchical Linguistic Interactions in DNNs
Figure 2 for Interpreting Hierarchical Linguistic Interactions in DNNs
Figure 3 for Interpreting Hierarchical Linguistic Interactions in DNNs
Figure 4 for Interpreting Hierarchical Linguistic Interactions in DNNs
Viaarxiv icon