Picture for Jinheng Wang

Jinheng Wang

Bitnet.cpp: Efficient Edge Inference for Ternary LLMs

Add code
Feb 17, 2025
Viaarxiv icon

1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs

Add code
Oct 21, 2024
Viaarxiv icon