Picture for Junho Kim

Junho Kim

DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization

Add code
Dec 12, 2024
Viaarxiv icon

Look Every Frame All at Once: Video-Ma$^2$mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing

Add code
Nov 29, 2024
Viaarxiv icon

SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis

Add code
Nov 25, 2024
Figure 1 for SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Figure 2 for SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Figure 3 for SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Figure 4 for SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Viaarxiv icon

D-Cube: Exploiting Hyper-Features of Diffusion Model for Robust Medical Classification

Add code
Nov 17, 2024
Viaarxiv icon

CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal Large Language Models

Add code
Nov 11, 2024
Viaarxiv icon

C2A: Client-Customized Adaptation for Parameter-Efficient Federated Learning

Add code
Nov 01, 2024
Viaarxiv icon

CleaR: Towards Robust and Generalized Parameter-Efficient Fine-Tuning for Noisy Label Learning

Add code
Oct 31, 2024
Viaarxiv icon

MELT: Materials-aware Continued Pre-training for Language Model Adaptation to Materials Science

Add code
Oct 19, 2024
Viaarxiv icon

Mentor-KD: Making Small Language Models Better Multi-step Reasoners

Add code
Oct 11, 2024
Figure 1 for Mentor-KD: Making Small Language Models Better Multi-step Reasoners
Figure 2 for Mentor-KD: Making Small Language Models Better Multi-step Reasoners
Figure 3 for Mentor-KD: Making Small Language Models Better Multi-step Reasoners
Figure 4 for Mentor-KD: Making Small Language Models Better Multi-step Reasoners
Viaarxiv icon

CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models

Add code
Jun 04, 2024
Viaarxiv icon