Picture for Sanghyuk Chun

Sanghyuk Chun

Probabilistic Language-Image Pre-Training

Add code
Oct 24, 2024
Figure 1 for Probabilistic Language-Image Pre-Training
Figure 2 for Probabilistic Language-Image Pre-Training
Figure 3 for Probabilistic Language-Image Pre-Training
Figure 4 for Probabilistic Language-Image Pre-Training
Viaarxiv icon

Read, Watch and Scream! Sound Generation from Text and Video

Add code
Jul 08, 2024
Figure 1 for Read, Watch and Scream! Sound Generation from Text and Video
Figure 2 for Read, Watch and Scream! Sound Generation from Text and Video
Figure 3 for Read, Watch and Scream! Sound Generation from Text and Video
Figure 4 for Read, Watch and Scream! Sound Generation from Text and Video
Viaarxiv icon

Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval

Add code
Jun 13, 2024
Viaarxiv icon

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts

Add code
Apr 26, 2024
Viaarxiv icon

Toward Interactive Regional Understanding in Vision-Large Language Models

Add code
Mar 27, 2024
Viaarxiv icon

Language-only Efficient Training of Zero-shot Composed Image Retrieval

Add code
Dec 04, 2023
Viaarxiv icon

Longer-range Contextualized Masked Autoencoder

Add code
Oct 20, 2023
Viaarxiv icon

Improved Probabilistic Image-Text Representations

Add code
May 29, 2023
Viaarxiv icon

RoCOCO: Robust Benchmark MS-COCO to Stress-test Robustness of Image-Text Matching Models

Add code
Apr 21, 2023
Viaarxiv icon

Three Recipes for Better 3D Pseudo-GTs of 3D Human Mesh Estimation in the Wild

Add code
Apr 10, 2023
Viaarxiv icon