Picture for Jinyoung Moon

Jinyoung Moon

HiCM$^2$: Hierarchical Compact Memory Modeling for Dense Video Captioning

Add code
Dec 19, 2024
Viaarxiv icon

Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval

Add code
Apr 11, 2024
Figure 1 for Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
Figure 2 for Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
Figure 3 for Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
Figure 4 for Do You Remember? Dense Video Captioning with Cross-Modal Memory Retrieval
Viaarxiv icon

Adaptive Self-training Framework for Fine-grained Scene Graph Generation

Add code
Jan 18, 2024
Viaarxiv icon

LLM4SGG: Large Language Model for Weakly Supervised Scene Graph Generation

Add code
Oct 20, 2023
Viaarxiv icon

Unbiased Heterogeneous Scene Graph Generation with Relation-aware Message Passing Neural Network

Add code
Dec 01, 2022
Viaarxiv icon

Information Elevation Network for Fast Online Action Detection

Add code
Sep 28, 2021
Figure 1 for Information Elevation Network for Fast Online Action Detection
Figure 2 for Information Elevation Network for Fast Online Action Detection
Figure 3 for Information Elevation Network for Fast Online Action Detection
Figure 4 for Information Elevation Network for Fast Online Action Detection
Viaarxiv icon

Learning to Discriminate Information for Online Action Detection: Analysis and Application

Add code
Sep 09, 2021
Figure 1 for Learning to Discriminate Information for Online Action Detection: Analysis and Application
Figure 2 for Learning to Discriminate Information for Online Action Detection: Analysis and Application
Figure 3 for Learning to Discriminate Information for Online Action Detection: Analysis and Application
Figure 4 for Learning to Discriminate Information for Online Action Detection: Analysis and Application
Viaarxiv icon

Learning to Combine the Modalities of Language and Video for Temporal Moment Localization

Add code
Sep 07, 2021
Figure 1 for Learning to Combine the Modalities of Language and Video for Temporal Moment Localization
Figure 2 for Learning to Combine the Modalities of Language and Video for Temporal Moment Localization
Figure 3 for Learning to Combine the Modalities of Language and Video for Temporal Moment Localization
Figure 4 for Learning to Combine the Modalities of Language and Video for Temporal Moment Localization
Viaarxiv icon

Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting

Add code
May 25, 2021
Figure 1 for Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting
Figure 2 for Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting
Figure 3 for Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting
Figure 4 for Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting
Viaarxiv icon

Diverse Temporal Aggregation and Depthwise Spatiotemporal Factorization for Efficient Video Classification

Add code
Dec 28, 2020
Figure 1 for Diverse Temporal Aggregation and Depthwise Spatiotemporal Factorization for Efficient Video Classification
Figure 2 for Diverse Temporal Aggregation and Depthwise Spatiotemporal Factorization for Efficient Video Classification
Figure 3 for Diverse Temporal Aggregation and Depthwise Spatiotemporal Factorization for Efficient Video Classification
Figure 4 for Diverse Temporal Aggregation and Depthwise Spatiotemporal Factorization for Efficient Video Classification
Viaarxiv icon