Picture for Kun Yan

Kun Yan

Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding

Add code
Jul 11, 2024
Viaarxiv icon

G-VOILA: Gaze-Facilitated Information Querying in Daily Scenarios

Add code
May 13, 2024
Viaarxiv icon

KU-DMIS-MSRA at RadSum23: Pre-trained Vision-Language Model for Radiology Report Summarization

Add code
Jul 10, 2023
Viaarxiv icon

GroundNLQ @ Ego4D Natural Language Queries Challenge 2023

Add code
Jun 27, 2023
Viaarxiv icon

Two-shot Video Object Segmentation

Add code
Mar 21, 2023
Viaarxiv icon

An Efficient COarse-to-fiNE Alignment Framework @ Ego4D Natural Language Queries Challenge 2022

Add code
Nov 16, 2022
Viaarxiv icon

HORIZON: A High-Resolution Panorama Synthesis Framework

Add code
Oct 10, 2022
Figure 1 for HORIZON: A High-Resolution Panorama Synthesis Framework
Figure 2 for HORIZON: A High-Resolution Panorama Synthesis Framework
Figure 3 for HORIZON: A High-Resolution Panorama Synthesis Framework
Figure 4 for HORIZON: A High-Resolution Panorama Synthesis Framework
Viaarxiv icon

CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding

Add code
Sep 22, 2022
Figure 1 for CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding
Figure 2 for CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding
Figure 3 for CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding
Figure 4 for CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding
Viaarxiv icon

Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention

Add code
Dec 07, 2021
Figure 1 for Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention
Figure 2 for Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention
Figure 3 for Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention
Figure 4 for Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention
Viaarxiv icon

CETransformer: Casual Effect Estimation via Transformer Based Representation Learning

Add code
Jul 19, 2021
Figure 1 for CETransformer: Casual Effect Estimation via Transformer Based Representation Learning
Figure 2 for CETransformer: Casual Effect Estimation via Transformer Based Representation Learning
Figure 3 for CETransformer: Casual Effect Estimation via Transformer Based Representation Learning
Figure 4 for CETransformer: Casual Effect Estimation via Transformer Based Representation Learning
Viaarxiv icon