Picture for Junjie Fei

Junjie Fei

Kestrel: Point Grounding Multimodal LLM for Part-Aware 3D Vision-Language Understanding

Add code
May 29, 2024
Viaarxiv icon

Transferable Decoding with Visual Entities for Zero-Shot Image Captioning

Add code
Jul 31, 2023
Figure 1 for Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Figure 2 for Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Figure 3 for Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Figure 4 for Transferable Decoding with Visual Entities for Zero-Shot Image Captioning
Viaarxiv icon

Caption Anything: Interactive Image Description with Diverse Multimodal Controls

Add code
May 08, 2023
Figure 1 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Figure 2 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Figure 3 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Figure 4 for Caption Anything: Interactive Image Description with Diverse Multimodal Controls
Viaarxiv icon