Picture for Bailu Ding

Bailu Ding

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Add code
Sep 16, 2024
Viaarxiv icon

Efficient Retrieval with Learned Similarities

Add code
Jul 22, 2024
Viaarxiv icon