Picture for Nilesh Jain

Nilesh Jain

SparAMX: Accelerating Compressed LLMs Token Generation on AMX-powered CPUs

Add code
Feb 18, 2025
Viaarxiv icon

Mamba-Shedder: Post-Transformer Compression for Efficient Selective Structured State Space Models

Add code
Jan 28, 2025
Viaarxiv icon

INRet: A General Framework for Accurate Retrieval of INRs for Shapes

Add code
Jan 27, 2025
Figure 1 for INRet: A General Framework for Accurate Retrieval of INRs for Shapes
Figure 2 for INRet: A General Framework for Accurate Retrieval of INRs for Shapes
Figure 3 for INRet: A General Framework for Accurate Retrieval of INRs for Shapes
Figure 4 for INRet: A General Framework for Accurate Retrieval of INRs for Shapes
Viaarxiv icon

MultiPruner: Balanced Structure Removal in Foundation Models

Add code
Jan 17, 2025
Viaarxiv icon

Enhancing Data Integrity through Provenance Tracking in Semantic Web Frameworks

Add code
Jan 12, 2025
Viaarxiv icon

Post-Training Statistical Calibration for Higher Activation Sparsity

Add code
Dec 10, 2024
Figure 1 for Post-Training Statistical Calibration for Higher Activation Sparsity
Figure 2 for Post-Training Statistical Calibration for Higher Activation Sparsity
Figure 3 for Post-Training Statistical Calibration for Higher Activation Sparsity
Figure 4 for Post-Training Statistical Calibration for Higher Activation Sparsity
Viaarxiv icon

SQFT: Low-cost Model Adaptation in Low-precision Sparse Foundation Models

Add code
Oct 01, 2024
Viaarxiv icon

Shears: Unstructured Sparsity with Neural Low-rank Adapter Search

Add code
Apr 16, 2024
Figure 1 for Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Figure 2 for Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Figure 3 for Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Figure 4 for Shears: Unstructured Sparsity with Neural Low-rank Adapter Search
Viaarxiv icon

Mem-Rec: Memory Efficient Recommendation System using Alternative Representation

Add code
May 15, 2023
Viaarxiv icon

Streaming Encoding Algorithms for Scalable Hyperdimensional Computing

Add code
Sep 28, 2022
Figure 1 for Streaming Encoding Algorithms for Scalable Hyperdimensional Computing
Figure 2 for Streaming Encoding Algorithms for Scalable Hyperdimensional Computing
Figure 3 for Streaming Encoding Algorithms for Scalable Hyperdimensional Computing
Figure 4 for Streaming Encoding Algorithms for Scalable Hyperdimensional Computing
Viaarxiv icon