Picture for Nilesh Jain

Nilesh Jain

TokenButler: Token Importance is Predictable

Add code
Mar 10, 2025
Viaarxiv icon

SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUs

Add code
Feb 18, 2025
Viaarxiv icon

Mamba-Shedder: Post-Transformer Compression for Efficient Selective Structured State Space Models

Add code
Jan 28, 2025
Figure 1 for Mamba-Shedder: Post-Transformer Compression for Efficient Selective Structured State Space Models
Figure 2 for Mamba-Shedder: Post-Transformer Compression for Efficient Selective Structured State Space Models
Figure 3 for Mamba-Shedder: Post-Transformer Compression for Efficient Selective Structured State Space Models
Figure 4 for Mamba-Shedder: Post-Transformer Compression for Efficient Selective Structured State Space Models
Viaarxiv icon

INRet: A General Framework for Accurate Retrieval of INRs for Shapes

Add code
Jan 27, 2025
Figure 1 for INRet: A General Framework for Accurate Retrieval of INRs for Shapes
Figure 2 for INRet: A General Framework for Accurate Retrieval of INRs for Shapes
Figure 3 for INRet: A General Framework for Accurate Retrieval of INRs for Shapes
Figure 4 for INRet: A General Framework for Accurate Retrieval of INRs for Shapes
Viaarxiv icon

MultiPruner: Balanced Structure Removal in Foundation Models

Add code
Jan 17, 2025
Figure 1 for MultiPruner: Balanced Structure Removal in Foundation Models
Figure 2 for MultiPruner: Balanced Structure Removal in Foundation Models
Figure 3 for MultiPruner: Balanced Structure Removal in Foundation Models
Figure 4 for MultiPruner: Balanced Structure Removal in Foundation Models
Viaarxiv icon

Enhancing Data Integrity through Provenance Tracking in Semantic Web Frameworks

Add code
Jan 12, 2025
Viaarxiv icon

Post-Training Statistical Calibration for Higher Activation Sparsity

Add code
Dec 10, 2024
Figure 1 for Post-Training Statistical Calibration for Higher Activation Sparsity
Figure 2 for Post-Training Statistical Calibration for Higher Activation Sparsity
Figure 3 for Post-Training Statistical Calibration for Higher Activation Sparsity
Figure 4 for Post-Training Statistical Calibration for Higher Activation Sparsity
Viaarxiv icon

SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models

Add code
Oct 01, 2024
Viaarxiv icon

Shears: Unstructured Sparsity with Neural Low-rank Adapter Search

Add code
Apr 16, 2024
Figure 1 for Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Figure 2 for Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Figure 3 for Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Figure 4 for Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Viaarxiv icon

Mem-Rec: Memory Efficient Recommendation System using Alternative Representation

Add code
May 15, 2023
Viaarxiv icon