Picture for Sangho Lee

Sangho Lee

One Diffusion to Generate Them All

Add code
Nov 25, 2024
Viaarxiv icon

Finding NeMo: Negative-mined Mosaic Augmentation for Referring Image Segmentation

Add code
Nov 03, 2024
Viaarxiv icon

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Add code
Sep 25, 2024
Figure 1 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 2 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 3 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 4 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Viaarxiv icon

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Add code
Dec 28, 2023
Viaarxiv icon

Integrated Path Tracking with DYC and MPC using LSTM Based Tire Force Estimator for Four-wheel Independent Steering and Driving Vehicle

Add code
Dec 13, 2023
Viaarxiv icon

Can Language Models Laugh at YouTube Short-form Videos?

Add code
Oct 26, 2023
Viaarxiv icon

X-CANIDS: Signal-Aware Explainable Intrusion Detection System for Controller Area Network-Based In-Vehicle Network

Add code
Mar 22, 2023
Viaarxiv icon

Boundary-aware Self-supervised Learning for Video Scene Segmentation

Add code
Jan 14, 2022
Figure 1 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Figure 2 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Figure 3 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Figure 4 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Viaarxiv icon

Unsupervised Representation Learning via Neural Activation Coding

Add code
Dec 07, 2021
Figure 1 for Unsupervised Representation Learning via Neural Activation Coding
Figure 2 for Unsupervised Representation Learning via Neural Activation Coding
Figure 3 for Unsupervised Representation Learning via Neural Activation Coding
Figure 4 for Unsupervised Representation Learning via Neural Activation Coding
Viaarxiv icon

Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning

Add code
Jan 26, 2021
Figure 1 for Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
Figure 2 for Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
Figure 3 for Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
Figure 4 for Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
Viaarxiv icon