Picture for Bo He

Bo He

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Add code
Apr 08, 2024
Viaarxiv icon

OmniVid: A Generative Framework for Universal Video Understanding

Add code
Mar 26, 2024
Figure 1 for OmniVid: A Generative Framework for Universal Video Understanding
Figure 2 for OmniVid: A Generative Framework for Universal Video Understanding
Figure 3 for OmniVid: A Generative Framework for Universal Video Understanding
Figure 4 for OmniVid: A Generative Framework for Universal Video Understanding
Viaarxiv icon

To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning

Add code
Nov 29, 2023
Viaarxiv icon

Chop & Learn: Recognizing and Generating Object-State Compositions

Add code
Sep 25, 2023
Viaarxiv icon

Towards Scalable Neural Representation for Diverse Videos

Add code
Mar 24, 2023
Figure 1 for Towards Scalable Neural Representation for Diverse Videos
Figure 2 for Towards Scalable Neural Representation for Diverse Videos
Figure 3 for Towards Scalable Neural Representation for Diverse Videos
Figure 4 for Towards Scalable Neural Representation for Diverse Videos
Viaarxiv icon

Align and Attend: Multimodal Summarization with Dual Contrastive Losses

Add code
Mar 13, 2023
Figure 1 for Align and Attend: Multimodal Summarization with Dual Contrastive Losses
Figure 2 for Align and Attend: Multimodal Summarization with Dual Contrastive Losses
Figure 3 for Align and Attend: Multimodal Summarization with Dual Contrastive Losses
Figure 4 for Align and Attend: Multimodal Summarization with Dual Contrastive Losses
Viaarxiv icon

CNeRV: Content-adaptive Neural Representation for Visual Data

Add code
Nov 18, 2022
Figure 1 for CNeRV: Content-adaptive Neural Representation for Visual Data
Figure 2 for CNeRV: Content-adaptive Neural Representation for Visual Data
Figure 3 for CNeRV: Content-adaptive Neural Representation for Visual Data
Figure 4 for CNeRV: Content-adaptive Neural Representation for Visual Data
Viaarxiv icon

Learning Semantic Correspondence with Sparse Annotations

Add code
Aug 17, 2022
Figure 1 for Learning Semantic Correspondence with Sparse Annotations
Figure 2 for Learning Semantic Correspondence with Sparse Annotations
Figure 3 for Learning Semantic Correspondence with Sparse Annotations
Figure 4 for Learning Semantic Correspondence with Sparse Annotations
Viaarxiv icon

ColdGuess: A General and Effective Relational Graph Convolutional Network to Tackle Cold Start Cases

Add code
May 26, 2022
Figure 1 for ColdGuess: A General and Effective Relational Graph Convolutional Network to Tackle Cold Start Cases
Figure 2 for ColdGuess: A General and Effective Relational Graph Convolutional Network to Tackle Cold Start Cases
Figure 3 for ColdGuess: A General and Effective Relational Graph Convolutional Network to Tackle Cold Start Cases
Figure 4 for ColdGuess: A General and Effective Relational Graph Convolutional Network to Tackle Cold Start Cases
Viaarxiv icon

ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization

Add code
Mar 29, 2022
Figure 1 for ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization
Figure 2 for ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization
Figure 3 for ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization
Figure 4 for ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization
Viaarxiv icon