Picture for Dianhai Yu

Dianhai Yu

FlashMask: Efficient and Rich Mask Extension of FlashAttention

Add code
Oct 02, 2024
Figure 1 for FlashMask: Efficient and Rich Mask Extension of FlashAttention
Figure 2 for FlashMask: Efficient and Rich Mask Extension of FlashAttention
Figure 3 for FlashMask: Efficient and Rich Mask Extension of FlashAttention
Figure 4 for FlashMask: Efficient and Rich Mask Extension of FlashAttention
Viaarxiv icon

NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time

Add code
Aug 07, 2024
Viaarxiv icon

A Framework for Cost-Effective and Self-Adaptive LLM Shaking and Recovery Mechanism

Add code
Mar 12, 2024
Viaarxiv icon

Spectral Heterogeneous Graph Convolutions via Positive Noncommutative Polynomials

Add code
May 31, 2023
Viaarxiv icon

Label Information Enhanced Fraud Detection against Low Homophily in Graphs

Add code
Feb 21, 2023
Viaarxiv icon

TA-MoE: Topology-Aware Large Scale Mixture-of-Expert Training

Add code
Feb 20, 2023
Viaarxiv icon

PP-YOLOE-R: An Efficient Anchor-Free Rotated Object Detector

Add code
Nov 04, 2022
Viaarxiv icon

PP-StructureV2: A Stronger Document Analysis System

Add code
Oct 11, 2022
Figure 1 for PP-StructureV2: A Stronger Document Analysis System
Figure 2 for PP-StructureV2: A Stronger Document Analysis System
Figure 3 for PP-StructureV2: A Stronger Document Analysis System
Figure 4 for PP-StructureV2: A Stronger Document Analysis System
Viaarxiv icon

ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding

Add code
Sep 18, 2022
Figure 1 for ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Figure 2 for ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Figure 3 for ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Figure 4 for ERNIE-mmLayout: Multi-grained MultiModal Transformer for Document Understanding
Viaarxiv icon

Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources

Add code
Jul 14, 2022
Figure 1 for Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources
Figure 2 for Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources
Figure 3 for Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources
Figure 4 for Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources
Viaarxiv icon