Picture for Vithursan Thangarasa

Vithursan Thangarasa

DREAM-S: Speculative Decoding with Searchable Drafting and Target-Aware Refinement for Multimodal Generation

Add code
May 30, 2026
Viaarxiv icon

DREAM-R: Multimodal Speculative Reasoning with RL-Based Refined Drafting, Precise Verification, and Fully Parallel Execution

Add code
May 27, 2026
Viaarxiv icon

CodeQuant: Unified Clustering and Quantization for Enhanced Outlier Smoothing in Low-Precision Mixture-of-Experts

Add code
Apr 12, 2026
Viaarxiv icon

DREAM: Drafting with Refined Target Features and Entropy-Adaptive Cross-Attention Fusion for Multimodal Speculative Decoding

Add code
May 25, 2025
Viaarxiv icon

MASSV: Multimodal Adaptation and Self-Data Distillation for Speculative Decoding of Vision-Language Models

Add code
May 15, 2025
Viaarxiv icon

SD$^2$: Self-Distilled Sparse Drafters

Add code
Apr 10, 2025
Viaarxiv icon

Self-Data Distillation for Recovering Quality in Pruned Large Language Models

Add code
Oct 15, 2024
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon

MediSwift: Efficient Sparse Pre-trained Biomedical Language Models

Add code
Mar 01, 2024
Figure 1 for MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Figure 2 for MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Figure 3 for MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Figure 4 for MediSwift: Efficient Sparse Pre-trained Biomedical Language Models
Viaarxiv icon

Sparse Iso-FLOP Transformations for Maximizing Training Efficiency

Add code
Mar 25, 2023
Figure 1 for Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Figure 2 for Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Figure 3 for Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Figure 4 for Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Viaarxiv icon