Picture for Yiren Zhao

Yiren Zhao

Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference

Add code
Jan 29, 2026
Viaarxiv icon

CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents

Add code
Jan 14, 2026
Viaarxiv icon

On the Existence and Behaviour of Secondary Attention Sinks

Add code
Dec 22, 2025
Viaarxiv icon

Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization

Add code
Jun 16, 2025
Figure 1 for Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization
Figure 2 for Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization
Figure 3 for Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization
Figure 4 for Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization
Viaarxiv icon

A3 : an Analytical Low-Rank Approximation Framework for Attention

Add code
May 19, 2025
Viaarxiv icon

AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks

Add code
Feb 28, 2025
Figure 1 for AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks
Figure 2 for AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks
Figure 3 for AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks
Figure 4 for AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks
Viaarxiv icon

ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph Environments

Add code
Feb 28, 2025
Figure 1 for ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph Environments
Figure 2 for ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph Environments
Figure 3 for ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph Environments
Figure 4 for ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph Environments
Viaarxiv icon

Cached Multi-Lora Composition for Multi-Concept Image Generation

Add code
Feb 07, 2025
Viaarxiv icon

Omni-DNA: A Unified Genomic Foundation Model for Cross-Modal and Multi-Task Learning

Add code
Feb 05, 2025
Figure 1 for Omni-DNA: A Unified Genomic Foundation Model for Cross-Modal and Multi-Task Learning
Figure 2 for Omni-DNA: A Unified Genomic Foundation Model for Cross-Modal and Multi-Task Learning
Figure 3 for Omni-DNA: A Unified Genomic Foundation Model for Cross-Modal and Multi-Task Learning
Figure 4 for Omni-DNA: A Unified Genomic Foundation Model for Cross-Modal and Multi-Task Learning
Viaarxiv icon

Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models

Add code
Dec 18, 2024
Figure 1 for Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models
Figure 2 for Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models
Figure 3 for Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models
Figure 4 for Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models
Viaarxiv icon