Picture for Raghuraman Krishnamoorthi

Raghuraman Krishnamoorthi

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Add code
Oct 22, 2024
Viaarxiv icon

Agent-as-a-Judge: Evaluate Agents with Agents

Add code
Oct 14, 2024
Viaarxiv icon

SpinQuant: LLM quantization with learned rotations

Add code
May 28, 2024
Viaarxiv icon

Communication Efficient Distributed Training with Distributed Lion

Add code
Mar 30, 2024
Viaarxiv icon

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Add code
Feb 22, 2024
Viaarxiv icon

SqueezeSAM: User friendly mobile interactive segmentation

Add code
Dec 11, 2023
Viaarxiv icon

Gen2Det: Generate to Detect

Add code
Dec 07, 2023
Viaarxiv icon

Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images

Add code
Dec 04, 2023
Figure 1 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Figure 2 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Figure 3 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Figure 4 for Diversify, Don't Fine-Tune: Scaling Up Visual Recognition Training with Synthetic Images
Viaarxiv icon

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Add code
Dec 01, 2023
Viaarxiv icon

MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning

Add code
Oct 26, 2023
Figure 1 for MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Figure 2 for MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Figure 3 for MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Figure 4 for MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Viaarxiv icon