Picture for Bo He

Bo He

UMono: Physical Model Informed Hybrid CNN-Transformer Framework for Underwater Monocular Depth Estimation

Add code
Jul 25, 2024
Viaarxiv icon

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Add code
Apr 08, 2024
Viaarxiv icon

OmniVid: A Generative Framework for Universal Video Understanding

Add code
Mar 26, 2024
Viaarxiv icon

To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning

Add code
Nov 29, 2023
Viaarxiv icon

Chop & Learn: Recognizing and Generating Object-State Compositions

Add code
Sep 25, 2023
Viaarxiv icon

Towards Scalable Neural Representation for Diverse Videos

Add code
Mar 24, 2023
Viaarxiv icon

Align and Attend: Multimodal Summarization with Dual Contrastive Losses

Add code
Mar 13, 2023
Viaarxiv icon

CNeRV: Content-adaptive Neural Representation for Visual Data

Add code
Nov 18, 2022
Viaarxiv icon

Learning Semantic Correspondence with Sparse Annotations

Add code
Aug 17, 2022
Figure 1 for Learning Semantic Correspondence with Sparse Annotations
Figure 2 for Learning Semantic Correspondence with Sparse Annotations
Figure 3 for Learning Semantic Correspondence with Sparse Annotations
Figure 4 for Learning Semantic Correspondence with Sparse Annotations
Viaarxiv icon

ColdGuess: A General and Effective Relational Graph Convolutional Network to Tackle Cold Start Cases

Add code
May 26, 2022
Figure 1 for ColdGuess: A General and Effective Relational Graph Convolutional Network to Tackle Cold Start Cases
Figure 2 for ColdGuess: A General and Effective Relational Graph Convolutional Network to Tackle Cold Start Cases
Figure 3 for ColdGuess: A General and Effective Relational Graph Convolutional Network to Tackle Cold Start Cases
Figure 4 for ColdGuess: A General and Effective Relational Graph Convolutional Network to Tackle Cold Start Cases
Viaarxiv icon