Picture for Seong-Whan Lee

Seong-Whan Lee

SUGAR: Leveraging Contextual Confidence for Smarter Retrieval

Add code
Jan 09, 2025
Viaarxiv icon

Multi-Context Temporal Consistent Modeling for Referring Video Object Segmentation

Add code
Jan 09, 2025
Figure 1 for Multi-Context Temporal Consistent Modeling for Referring Video Object Segmentation
Figure 2 for Multi-Context Temporal Consistent Modeling for Referring Video Object Segmentation
Figure 3 for Multi-Context Temporal Consistent Modeling for Referring Video Object Segmentation
Figure 4 for Multi-Context Temporal Consistent Modeling for Referring Video Object Segmentation
Viaarxiv icon

JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis

Add code
Jan 09, 2025
Viaarxiv icon

FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching

Add code
Jan 09, 2025
Viaarxiv icon

Towards Personalized Brain-Computer Interface Application Based on Endogenous EEG Paradigms

Add code
Nov 18, 2024
Figure 1 for Towards Personalized Brain-Computer Interface Application Based on Endogenous EEG Paradigms
Figure 2 for Towards Personalized Brain-Computer Interface Application Based on Endogenous EEG Paradigms
Figure 3 for Towards Personalized Brain-Computer Interface Application Based on Endogenous EEG Paradigms
Viaarxiv icon

Dynamic Neural Communication: Convergence of Computer Vision and Brain-Computer Interface

Add code
Nov 14, 2024
Figure 1 for Dynamic Neural Communication: Convergence of Computer Vision and Brain-Computer Interface
Figure 2 for Dynamic Neural Communication: Convergence of Computer Vision and Brain-Computer Interface
Figure 3 for Dynamic Neural Communication: Convergence of Computer Vision and Brain-Computer Interface
Viaarxiv icon

Enhancing Multimodal Query Representation via Visual Dialogues for End-to-End Knowledge Retrieval

Add code
Nov 13, 2024
Viaarxiv icon

EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector

Add code
Nov 04, 2024
Figure 1 for EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
Figure 2 for EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
Figure 3 for EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
Figure 4 for EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector
Viaarxiv icon

EEG-based Multimodal Representation Learning for Emotion Recognition

Add code
Oct 29, 2024
Viaarxiv icon

Accelerating High-Fidelity Waveform Generation via Adversarial Flow Matching Optimization

Add code
Aug 15, 2024
Viaarxiv icon