Picture for Jihwan Park

Jihwan Park

Bridging the Gap between Audio and Text using Parallel-attention for User-defined Keyword Spotting

Add code
Aug 07, 2024
Viaarxiv icon

Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble

Add code
Jul 27, 2024
Viaarxiv icon

Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection

Add code
Mar 26, 2024
Viaarxiv icon

Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models

Add code
Aug 18, 2023
Viaarxiv icon

Joint unsupervised and supervised learning for context-aware language identification

Add code
Apr 14, 2023
Viaarxiv icon

That's What I Said: Fully-Controllable Talking Face Generation

Add code
Apr 06, 2023
Viaarxiv icon

Metric Learning for User-defined Keyword Spotting

Add code
Nov 01, 2022
Viaarxiv icon

Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection

Add code
Apr 11, 2022
Figure 1 for Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection
Figure 2 for Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection
Figure 3 for Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection
Figure 4 for Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection
Viaarxiv icon

Deformable Graph Convolutional Networks

Add code
Dec 29, 2021
Figure 1 for Deformable Graph Convolutional Networks
Figure 2 for Deformable Graph Convolutional Networks
Figure 3 for Deformable Graph Convolutional Networks
Figure 4 for Deformable Graph Convolutional Networks
Viaarxiv icon