Picture for Taehee Jeong

Taehee Jeong

4bit-Quantization in Vector-Embedding for RAG

Add code
Jan 17, 2025
Viaarxiv icon

Weight Block Sparsity: Training, Compilation, and AI Engine Accelerators

Add code
Jul 12, 2024
Viaarxiv icon