Picture for Bumsoo Kim

Bumsoo Kim

MultiFloodSynth: Multi-Annotated Flood Synthetic Dataset Generation

Add code
Feb 10, 2025
Viaarxiv icon

ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition

Add code
Dec 21, 2024
Figure 1 for ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition
Figure 2 for ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition
Figure 3 for ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition
Figure 4 for ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition
Viaarxiv icon

Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning

Add code
Nov 24, 2024
Figure 1 for Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning
Figure 2 for Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning
Figure 3 for Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning
Figure 4 for Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning
Viaarxiv icon

See It All: Contextualized Late Aggregation for 3D Dense Captioning

Add code
Aug 14, 2024
Figure 1 for See It All: Contextualized Late Aggregation for 3D Dense Captioning
Figure 2 for See It All: Contextualized Late Aggregation for 3D Dense Captioning
Figure 3 for See It All: Contextualized Late Aggregation for 3D Dense Captioning
Figure 4 for See It All: Contextualized Late Aggregation for 3D Dense Captioning
Viaarxiv icon

Bi-directional Contextual Attention for 3D Dense Captioning

Add code
Aug 13, 2024
Figure 1 for Bi-directional Contextual Attention for 3D Dense Captioning
Figure 2 for Bi-directional Contextual Attention for 3D Dense Captioning
Figure 3 for Bi-directional Contextual Attention for 3D Dense Captioning
Figure 4 for Bi-directional Contextual Attention for 3D Dense Captioning
Viaarxiv icon

Cartoon Hallucinations Detection: Pose-aware In Context Visual Learning

Add code
Mar 25, 2024
Viaarxiv icon

Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application

Add code
Feb 08, 2024
Viaarxiv icon

ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer

Add code
Feb 05, 2024
Figure 1 for ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer
Figure 2 for ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer
Figure 3 for ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer
Figure 4 for ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer
Viaarxiv icon

UnionDet: Union-Level Detector Towards Real-Time Human-Object Interaction Detection

Add code
Dec 19, 2023
Viaarxiv icon

Misalign, Contrast then Distill: Rethinking Misalignments in Language-Image Pretraining

Add code
Dec 19, 2023
Viaarxiv icon