Picture for Sanghyuk Chun

Sanghyuk Chun

LongProLIP: A Probabilistic Vision-Language Model with Long Context Text

Add code
Mar 11, 2025
Viaarxiv icon

DNNs May Determine Major Properties of Their Outputs Early, with Timing Possibly Driven by Bias

Add code
Feb 12, 2025
Viaarxiv icon

Probabilistic Language-Image Pre-Training

Add code
Oct 24, 2024
Figure 1 for Probabilistic Language-Image Pre-Training
Figure 2 for Probabilistic Language-Image Pre-Training
Figure 3 for Probabilistic Language-Image Pre-Training
Figure 4 for Probabilistic Language-Image Pre-Training
Viaarxiv icon

Read, Watch and Scream! Sound Generation from Text and Video

Add code
Jul 08, 2024
Figure 1 for Read, Watch and Scream! Sound Generation from Text and Video
Figure 2 for Read, Watch and Scream! Sound Generation from Text and Video
Figure 3 for Read, Watch and Scream! Sound Generation from Text and Video
Figure 4 for Read, Watch and Scream! Sound Generation from Text and Video
Viaarxiv icon

Reducing Task Discrepancy of Text Encoders for Zero-Shot Composed Image Retrieval

Add code
Jun 13, 2024
Viaarxiv icon

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts

Add code
Apr 26, 2024
Viaarxiv icon

Toward Interactive Regional Understanding in Vision-Large Language Models

Add code
Mar 27, 2024
Viaarxiv icon

Language-only Efficient Training of Zero-shot Composed Image Retrieval

Add code
Dec 04, 2023
Viaarxiv icon

Longer-range Contextualized Masked Autoencoder

Add code
Oct 20, 2023
Viaarxiv icon

Improved Probabilistic Image-Text Representations

Add code
May 29, 2023
Viaarxiv icon