Picture for Taewhan Kim

Taewhan Kim

ManipGPT: Is Affordance Segmentation by Large Vision Models Enough for Articulated Object Manipulation?

Add code
Dec 13, 2024
Viaarxiv icon

IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning

Add code
Sep 26, 2024
Viaarxiv icon