Picture for Ji-Hoon Kim

Ji-Hoon Kim

Accelerating Codec-based Speech Synthesis with Multi-Token Prediction and Speculative Decoding

Add code
Oct 17, 2024
Viaarxiv icon

Text-To-Speech Synthesis In The Wild

Add code
Sep 13, 2024
Viaarxiv icon

VoxSim: A perceptual voice similarity dataset

Add code
Jul 26, 2024
Viaarxiv icon

FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional Flow Matching

Add code
Jun 13, 2024
Viaarxiv icon

Faces that Speak: Jointly Synthesising Talking Face and Speech from Text

Add code
May 16, 2024
Viaarxiv icon

FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder

Add code
Jan 18, 2024
Viaarxiv icon

Let There Be Sound: Reconstructing High Quality Speech from Silent Videos

Add code
Aug 29, 2023
Viaarxiv icon

CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis

Add code
Feb 28, 2023
Viaarxiv icon

Relation-aware Language-Graph Transformer for Question Answering

Add code
Dec 02, 2022
Viaarxiv icon

Accelerating Large-Scale Graph-based Nearest Neighbor Search on a Computational Storage Platform

Add code
Jul 12, 2022
Figure 1 for Accelerating Large-Scale Graph-based Nearest Neighbor Search on a Computational Storage Platform
Figure 2 for Accelerating Large-Scale Graph-based Nearest Neighbor Search on a Computational Storage Platform
Figure 3 for Accelerating Large-Scale Graph-based Nearest Neighbor Search on a Computational Storage Platform
Figure 4 for Accelerating Large-Scale Graph-based Nearest Neighbor Search on a Computational Storage Platform
Viaarxiv icon