Picture for Seong Tae Kim

Seong Tae Kim

HiCM$^2$: Hierarchical Compact Memory Modeling for Dense Video Captioning

Add code
Dec 19, 2024
Figure 1 for HiCM$^2$: Hierarchical Compact Memory Modeling for Dense Video Captioning
Figure 2 for HiCM$^2$: Hierarchical Compact Memory Modeling for Dense Video Captioning
Figure 3 for HiCM$^2$: Hierarchical Compact Memory Modeling for Dense Video Captioning
Figure 4 for HiCM$^2$: Hierarchical Compact Memory Modeling for Dense Video Captioning
Viaarxiv icon

Resource-Efficient Medical Report Generation using Large Language Models

Add code
Oct 21, 2024
Viaarxiv icon

LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies

Add code
Oct 07, 2024
Viaarxiv icon

Retrieval-Augmented Natural Language Reasoning for Explainable Visual Question Answering

Add code
Aug 30, 2024
Viaarxiv icon

MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection

Add code
Jul 23, 2024
Figure 1 for MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection
Figure 2 for MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection
Figure 3 for MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection
Figure 4 for MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection
Viaarxiv icon

Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain

Add code
Jul 16, 2024
Figure 1 for Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain
Figure 2 for Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain
Figure 3 for Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain
Figure 4 for Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical Domain
Viaarxiv icon

Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval

Add code
Apr 11, 2024
Figure 1 for Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
Figure 2 for Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
Figure 3 for Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
Figure 4 for Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
Viaarxiv icon

WWW: A Unified Framework for Explaining What, Where and Why of Neural Networks by Interpretation of Neuron Concepts

Add code
Feb 29, 2024
Viaarxiv icon

OnDev-LCT: On-Device Lightweight Convolutional Transformers towards federated learning

Add code
Jan 22, 2024
Viaarxiv icon

One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era

Add code
Apr 04, 2023
Viaarxiv icon