Picture for Xinfeng Xie

Xinfeng Xie

Jack

Context Parallelism for Scalable Million-Token Inference

Add code
Nov 04, 2024
Viaarxiv icon

Hierarchical Structured Neural Network for Retrieval

Add code
Aug 13, 2024
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization

Add code
May 23, 2024
Viaarxiv icon

A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules

Add code
Dec 07, 2021
Figure 1 for A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules
Figure 2 for A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules
Figure 3 for A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules
Figure 4 for A Transferable Approach for Partitioning Machine Learning Models on Multi-Chip-Modules
Viaarxiv icon

FPSA: A Full System Stack Solution for Reconfigurable ReRAM-based NN Accelerator Architecture

Add code
Jan 28, 2019
Figure 1 for FPSA: A Full System Stack Solution for Reconfigurable ReRAM-based NN Accelerator Architecture
Figure 2 for FPSA: A Full System Stack Solution for Reconfigurable ReRAM-based NN Accelerator Architecture
Figure 3 for FPSA: A Full System Stack Solution for Reconfigurable ReRAM-based NN Accelerator Architecture
Figure 4 for FPSA: A Full System Stack Solution for Reconfigurable ReRAM-based NN Accelerator Architecture
Viaarxiv icon

QGAN: Quantized Generative Adversarial Networks

Add code
Jan 24, 2019
Figure 1 for QGAN: Quantized Generative Adversarial Networks
Figure 2 for QGAN: Quantized Generative Adversarial Networks
Figure 3 for QGAN: Quantized Generative Adversarial Networks
Figure 4 for QGAN: Quantized Generative Adversarial Networks
Viaarxiv icon