Picture for Jitesh Jain

Jitesh Jain

OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation

Add code
Dec 12, 2024
Viaarxiv icon

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

Add code
May 09, 2024
Figure 1 for CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Figure 2 for CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Figure 3 for CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Figure 4 for CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
Viaarxiv icon

Benchmarking Object Detectors with COCO: A New Path Forward

Add code
Mar 27, 2024
Viaarxiv icon

VCoder: Versatile Vision Encoders for Multimodal Large Language Models

Add code
Dec 21, 2023
Viaarxiv icon

Matting Anything

Add code
Jun 08, 2023
Viaarxiv icon

OneFormer: One Transformer to Rule Universal Image Segmentation

Add code
Nov 10, 2022
Viaarxiv icon

Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand

Add code
Aug 05, 2022
Viaarxiv icon

SeMask: Semantically Masked Transformers for Semantic Segmentation

Add code
Dec 23, 2021
Figure 1 for SeMask: Semantically Masked Transformers for Semantic Segmentation
Figure 2 for SeMask: Semantically Masked Transformers for Semantic Segmentation
Figure 3 for SeMask: Semantically Masked Transformers for Semantic Segmentation
Figure 4 for SeMask: Semantically Masked Transformers for Semantic Segmentation
Viaarxiv icon

DEAP Cache: Deep Eviction Admission and Prefetching for Cache

Add code
Sep 19, 2020
Figure 1 for DEAP Cache: Deep Eviction Admission and Prefetching for Cache
Figure 2 for DEAP Cache: Deep Eviction Admission and Prefetching for Cache
Figure 3 for DEAP Cache: Deep Eviction Admission and Prefetching for Cache
Viaarxiv icon