Picture for Song Park

Song Park

MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation

Add code
Nov 28, 2024
Viaarxiv icon

Probabilistic Language-Image Pre-Training

Add code
Oct 24, 2024
Figure 1 for Probabilistic Language-Image Pre-Training
Figure 2 for Probabilistic Language-Image Pre-Training
Figure 3 for Probabilistic Language-Image Pre-Training
Figure 4 for Probabilistic Language-Image Pre-Training
Viaarxiv icon

Rotary Position Embedding for Vision Transformer

Add code
Mar 20, 2024
Viaarxiv icon

Forging Tokens for Improved Storage-efficient Training

Add code
Dec 15, 2023
Viaarxiv icon

SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage

Add code
Mar 20, 2023
Viaarxiv icon

Similarity of Neural Architectures Based on Input Gradient Transferability

Add code
Oct 20, 2022
Figure 1 for Similarity of Neural Architectures Based on Input Gradient Transferability
Figure 2 for Similarity of Neural Architectures Based on Input Gradient Transferability
Figure 3 for Similarity of Neural Architectures Based on Input Gradient Transferability
Figure 4 for Similarity of Neural Architectures Based on Input Gradient Transferability
Viaarxiv icon

ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO

Add code
Apr 14, 2022
Figure 1 for ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO
Figure 2 for ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO
Figure 3 for ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO
Figure 4 for ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO
Viaarxiv icon

Few-shot Font Generation with Weakly Supervised Localized Representations

Add code
Dec 22, 2021
Figure 1 for Few-shot Font Generation with Weakly Supervised Localized Representations
Figure 2 for Few-shot Font Generation with Weakly Supervised Localized Representations
Figure 3 for Few-shot Font Generation with Weakly Supervised Localized Representations
Figure 4 for Few-shot Font Generation with Weakly Supervised Localized Representations
Viaarxiv icon

StyleAugment: Learning Texture De-biased Representations by Style Augmentation without Pre-defined Textures

Add code
Aug 24, 2021
Figure 1 for StyleAugment: Learning Texture De-biased Representations by Style Augmentation without Pre-defined Textures
Figure 2 for StyleAugment: Learning Texture De-biased Representations by Style Augmentation without Pre-defined Textures
Figure 3 for StyleAugment: Learning Texture De-biased Representations by Style Augmentation without Pre-defined Textures
Figure 4 for StyleAugment: Learning Texture De-biased Representations by Style Augmentation without Pre-defined Textures
Viaarxiv icon

Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts

Add code
Apr 02, 2021
Figure 1 for Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts
Figure 2 for Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts
Figure 3 for Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts
Figure 4 for Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts
Viaarxiv icon