Picture for Basura Fernando

Basura Fernando

FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data

Add code
Nov 22, 2024
Viaarxiv icon

Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios

Add code
Nov 20, 2024
Viaarxiv icon

Situational Scene Graph for Structured Human-centric Situation Understanding

Add code
Oct 30, 2024
Viaarxiv icon

Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework

Add code
Oct 14, 2024
Figure 1 for Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
Figure 2 for Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
Figure 3 for Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
Figure 4 for Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
Viaarxiv icon

Effectively Leveraging CLIP for Generating Situational Summaries of Images and Videos

Add code
Jul 30, 2024
Viaarxiv icon

CausalChaos! Dataset for Comprehensive Causal Action Question Answering Over Longer Causal Chains Grounded in Dynamic Visual Scenes

Add code
Apr 01, 2024
Viaarxiv icon

Zero Shot Open-ended Video Inference

Add code
Jan 23, 2024
Viaarxiv icon

Learning to Visually Connect Actions and their Effects

Add code
Jan 19, 2024
Viaarxiv icon

Motion Flow Matching for Human Motion Synthesis and Editing

Add code
Dec 14, 2023
Viaarxiv icon

Semi-supervised multimodal coreference resolution in image narrations

Add code
Oct 20, 2023
Viaarxiv icon