Picture for Chiyue Wei

Chiyue Wei

Helen

Hamming Attention Distillation: Binarizing Keys and Queries for Efficient Long-Context Transformers

Add code
Feb 03, 2025
Viaarxiv icon

A Survey: Collaborative Hardware and Software Design in the Era of Large Language Models

Add code
Oct 08, 2024
Viaarxiv icon

DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting

Add code
Aug 16, 2023
Viaarxiv icon