Picture for Souvik Kundu

Souvik Kundu

Callie

LANTERN++: Enhanced Relaxed Speculative Decoding with Static Tree Drafting for Visual Auto-regressive Models

Add code
Feb 10, 2025
Figure 1 for LANTERN++: Enhanced Relaxed Speculative Decoding with Static Tree Drafting for Visual Auto-regressive Models
Figure 2 for LANTERN++: Enhanced Relaxed Speculative Decoding with Static Tree Drafting for Visual Auto-regressive Models
Figure 3 for LANTERN++: Enhanced Relaxed Speculative Decoding with Static Tree Drafting for Visual Auto-regressive Models
Figure 4 for LANTERN++: Enhanced Relaxed Speculative Decoding with Static Tree Drafting for Visual Auto-regressive Models
Viaarxiv icon

CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing

Add code
Feb 04, 2025
Viaarxiv icon

Unraveling Zeroth-Order Optimization through the Lens of Low-Dimensional Structured Perturbations

Add code
Jan 31, 2025
Viaarxiv icon

AttentionBreaker: Adaptive Evolutionary Optimization for Unmasking Vulnerabilities in LLMs through Bit-Flip Attacks

Add code
Nov 21, 2024
Viaarxiv icon

MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization

Add code
Nov 08, 2024
Figure 1 for MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization
Figure 2 for MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization
Figure 3 for MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization
Figure 4 for MicroScopiQ: Accelerating Foundational Models through Outlier-Aware Microscaling Quantization
Viaarxiv icon

LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding

Add code
Oct 04, 2024
Figure 1 for LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding
Figure 2 for LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding
Figure 3 for LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding
Figure 4 for LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding
Viaarxiv icon

Understanding the Performance and Estimating the Cost of LLM Fine-Tuning

Add code
Aug 08, 2024
Figure 1 for Understanding the Performance and Estimating the Cost of LLM Fine-Tuning
Figure 2 for Understanding the Performance and Estimating the Cost of LLM Fine-Tuning
Figure 3 for Understanding the Performance and Estimating the Cost of LLM Fine-Tuning
Figure 4 for Understanding the Performance and Estimating the Cost of LLM Fine-Tuning
Viaarxiv icon

MaskVD: Region Masking for Efficient Video Object Detection

Add code
Jul 16, 2024
Viaarxiv icon

Metron: Holistic Performance Evaluation Framework for LLM Inference Systems

Add code
Jul 09, 2024
Figure 1 for Metron: Holistic Performance Evaluation Framework for LLM Inference Systems
Figure 2 for Metron: Holistic Performance Evaluation Framework for LLM Inference Systems
Figure 3 for Metron: Holistic Performance Evaluation Framework for LLM Inference Systems
Figure 4 for Metron: Holistic Performance Evaluation Framework for LLM Inference Systems
Viaarxiv icon

CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs

Add code
Jul 07, 2024
Figure 1 for CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs
Figure 2 for CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs
Figure 3 for CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs
Figure 4 for CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs
Viaarxiv icon