Picture for Bumsoo Kim

Bumsoo Kim

Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning

Add code
Nov 24, 2024
Figure 1 for Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning
Figure 2 for Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning
Figure 3 for Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning
Figure 4 for Is 'Right' Right? Enhancing Object Orientation Understanding in Multimodal Language Models through Egocentric Instruction Tuning
Viaarxiv icon

See It All: Contextualized Late Aggregation for 3D Dense Captioning

Add code
Aug 14, 2024
Figure 1 for See It All: Contextualized Late Aggregation for 3D Dense Captioning
Figure 2 for See It All: Contextualized Late Aggregation for 3D Dense Captioning
Figure 3 for See It All: Contextualized Late Aggregation for 3D Dense Captioning
Figure 4 for See It All: Contextualized Late Aggregation for 3D Dense Captioning
Viaarxiv icon

Bi-directional Contextual Attention for 3D Dense Captioning

Add code
Aug 13, 2024
Figure 1 for Bi-directional Contextual Attention for 3D Dense Captioning
Figure 2 for Bi-directional Contextual Attention for 3D Dense Captioning
Figure 3 for Bi-directional Contextual Attention for 3D Dense Captioning
Figure 4 for Bi-directional Contextual Attention for 3D Dense Captioning
Viaarxiv icon

Cartoon Hallucinations Detection: Pose-aware In Context Visual Learning

Add code
Mar 25, 2024
Viaarxiv icon

Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application

Add code
Feb 08, 2024
Viaarxiv icon

ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer

Add code
Feb 05, 2024
Figure 1 for ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer
Figure 2 for ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer
Figure 3 for ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer
Figure 4 for ToonAging: Face Re-Aging upon Artistic Portrait Style Transfer
Viaarxiv icon

Expediting Contrastive Language-Image Pretraining via Self-distilled Encoders

Add code
Dec 19, 2023
Viaarxiv icon

Misalign, Contrast then Distill: Rethinking Misalignments in Language-Image Pretraining

Add code
Dec 19, 2023
Viaarxiv icon

UnionDet: Union-Level Detector Towards Real-Time Human-Object Interaction Detection

Add code
Dec 19, 2023
Viaarxiv icon

Video Face Re-Aging: Toward Temporally Consistent Face Re-Aging

Add code
Dec 07, 2023
Viaarxiv icon