Picture for Bumsoo Kim

Bumsoo Kim

ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition

Add code
Dec 21, 2024
Viaarxiv icon

Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning

Add code
Nov 24, 2024
Figure 1 for Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning
Figure 2 for Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning
Figure 3 for Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning
Figure 4 for Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning
Viaarxiv icon

See It All: Contextualized Late Aggregation for 3D Dense Captioning

Add code
Aug 14, 2024
Figure 1 for See It All: Contextualized Late Aggregation for 3D Dense Captioning
Figure 2 for See It All: Contextualized Late Aggregation for 3D Dense Captioning
Figure 3 for See It All: Contextualized Late Aggregation for 3D Dense Captioning
Figure 4 for See It All: Contextualized Late Aggregation for 3D Dense Captioning
Viaarxiv icon

Bi-directional Contextual Attention for 3D Dense Captioning

Add code
Aug 13, 2024
Figure 1 for Bi-directional Contextual Attention for 3D Dense Captioning
Figure 2 for Bi-directional Contextual Attention for 3D Dense Captioning
Figure 3 for Bi-directional Contextual Attention for 3D Dense Captioning
Figure 4 for Bi-directional Contextual Attention for 3D Dense Captioning
Viaarxiv icon

Cartoon Hallucinations Detection: Pose-aware In Context Visual Learning

Add code
Mar 25, 2024
Viaarxiv icon

Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application

Add code
Feb 08, 2024
Viaarxiv icon

ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer

Add code
Feb 05, 2024
Figure 1 for ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer
Figure 2 for ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer
Figure 3 for ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer
Figure 4 for ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer
Viaarxiv icon

UnionDet: Union-Level Detector Towards Real-Time Human-Object Interaction Detection

Add code
Dec 19, 2023
Viaarxiv icon

Misalign, Contrast then Distill: Rethinking Misalignments in Language-Image Pretraining

Add code
Dec 19, 2023
Viaarxiv icon

Expediting Contrastive Language-Image Pretraining via Self-distilled Encoders

Add code
Dec 19, 2023
Viaarxiv icon