Picture for Basura Fernando

Basura Fernando

Neuro Symbolic Knowledge Reasoning for Procedural Video Question Answering

Add code
Mar 19, 2025
Viaarxiv icon

Learning to Generate Long-term Future Narrations Describing Activities of Daily Living

Add code
Mar 03, 2025
Viaarxiv icon

PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning

Add code
Feb 17, 2025
Viaarxiv icon

FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data

Add code
Nov 22, 2024
Figure 1 for FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data
Figure 2 for FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data
Figure 3 for FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data
Figure 4 for FedMLLM: Federated Fine-tuning MLLM on Multimodal Heterogeneity Data
Viaarxiv icon

Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios

Add code
Nov 20, 2024
Viaarxiv icon

Situational Scene Graph for Structured Human-centric Situation Understanding

Add code
Oct 30, 2024
Viaarxiv icon

Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework

Add code
Oct 14, 2024
Figure 1 for Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
Figure 2 for Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
Figure 3 for Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
Figure 4 for Cross-Modal Few-Shot Learning: a Generative Transfer Learning Framework
Viaarxiv icon

Effectively Leveraging CLIP for Generating Situational Summaries of Images and Videos

Add code
Jul 30, 2024
Figure 1 for Effectively Leveraging CLIP for Generating Situational Summaries of Images and Videos
Figure 2 for Effectively Leveraging CLIP for Generating Situational Summaries of Images and Videos
Figure 3 for Effectively Leveraging CLIP for Generating Situational Summaries of Images and Videos
Figure 4 for Effectively Leveraging CLIP for Generating Situational Summaries of Images and Videos
Viaarxiv icon

CausalChaos! Dataset for Comprehensive Causal Action Question Answering Over Longer Causal Chains Grounded in Dynamic Visual Scenes

Add code
Apr 01, 2024
Viaarxiv icon

Zero Shot Open-ended Video Inference

Add code
Jan 23, 2024
Viaarxiv icon