Picture for Brian Chen

Brian Chen

Personalized Video Summarization by Multimodal Video Understanding

Add code
Nov 05, 2024
Figure 1 for Personalized Video Summarization by Multimodal Video Understanding
Figure 2 for Personalized Video Summarization by Multimodal Video Understanding
Figure 3 for Personalized Video Summarization by Multimodal Video Understanding
Figure 4 for Personalized Video Summarization by Multimodal Video Understanding
Viaarxiv icon

User-in-the-loop Evaluation of Multimodal LLMs for Activity Assistance

Add code
Aug 04, 2024
Viaarxiv icon

EgoTV: Egocentric Task Verification from Natural Language Task Descriptions

Add code
Apr 17, 2023
Viaarxiv icon

What, when, and where? -- Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions

Add code
Mar 29, 2023
Viaarxiv icon

Interpretable Graph Convolutional Network of Multi-Modality Brain Imaging for Alzheimer's Disease Diagnosis

Add code
Apr 27, 2022
Figure 1 for Interpretable Graph Convolutional Network of Multi-Modality Brain Imaging for Alzheimer's Disease Diagnosis
Figure 2 for Interpretable Graph Convolutional Network of Multi-Modality Brain Imaging for Alzheimer's Disease Diagnosis
Figure 3 for Interpretable Graph Convolutional Network of Multi-Modality Brain Imaging for Alzheimer's Disease Diagnosis
Figure 4 for Interpretable Graph Convolutional Network of Multi-Modality Brain Imaging for Alzheimer's Disease Diagnosis
Viaarxiv icon

Numerical and geometrical aspects of flow-based variational quantum Monte Carlo

Add code
Mar 28, 2022
Figure 1 for Numerical and geometrical aspects of flow-based variational quantum Monte Carlo
Figure 2 for Numerical and geometrical aspects of flow-based variational quantum Monte Carlo
Figure 3 for Numerical and geometrical aspects of flow-based variational quantum Monte Carlo
Figure 4 for Numerical and geometrical aspects of flow-based variational quantum Monte Carlo
Viaarxiv icon

Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval

Add code
Dec 08, 2021
Figure 1 for Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Figure 2 for Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Figure 3 for Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Figure 4 for Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
Viaarxiv icon

PreViTS: Contrastive Pretraining with Video Tracking Supervision

Add code
Dec 01, 2021
Figure 1 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Figure 2 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Figure 3 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Figure 4 for PreViTS: Contrastive Pretraining with Video Tracking Supervision
Viaarxiv icon

Routing with Self-Attention for Multimodal Capsule Networks

Add code
Dec 01, 2021
Figure 1 for Routing with Self-Attention for Multimodal Capsule Networks
Figure 2 for Routing with Self-Attention for Multimodal Capsule Networks
Figure 3 for Routing with Self-Attention for Multimodal Capsule Networks
Figure 4 for Routing with Self-Attention for Multimodal Capsule Networks
Viaarxiv icon

Cascaded Multilingual Audio-Visual Learning from Videos

Add code
Nov 08, 2021
Figure 1 for Cascaded Multilingual Audio-Visual Learning from Videos
Figure 2 for Cascaded Multilingual Audio-Visual Learning from Videos
Figure 3 for Cascaded Multilingual Audio-Visual Learning from Videos
Figure 4 for Cascaded Multilingual Audio-Visual Learning from Videos
Viaarxiv icon