Picture for Ramyad Hadidi

Ramyad Hadidi

Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference

Add code
Jun 17, 2024
Figure 1 for Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference
Figure 2 for Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference
Figure 3 for Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference
Figure 4 for Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference
Viaarxiv icon

Network architecture search of X-ray based scientific applications

Add code
Apr 16, 2024
Viaarxiv icon

Context-Aware Task Handling in Resource-Constrained Robots with Virtualization

Add code
Apr 09, 2021
Figure 1 for Context-Aware Task Handling in Resource-Constrained Robots with Virtualization
Figure 2 for Context-Aware Task Handling in Resource-Constrained Robots with Virtualization
Figure 3 for Context-Aware Task Handling in Resource-Constrained Robots with Virtualization
Figure 4 for Context-Aware Task Handling in Resource-Constrained Robots with Virtualization
Viaarxiv icon

Reducing Inference Latency with Concurrent Architectures for Image Recognition

Add code
Nov 13, 2020
Figure 1 for Reducing Inference Latency with Concurrent Architectures for Image Recognition
Figure 2 for Reducing Inference Latency with Concurrent Architectures for Image Recognition
Figure 3 for Reducing Inference Latency with Concurrent Architectures for Image Recognition
Figure 4 for Reducing Inference Latency with Concurrent Architectures for Image Recognition
Viaarxiv icon

Edge-Tailored Perception: Fast Inferencing in-the-Edge with Efficient Model Distribution

Add code
Mar 13, 2020
Figure 1 for Edge-Tailored Perception: Fast Inferencing in-the-Edge with Efficient Model Distribution
Figure 2 for Edge-Tailored Perception: Fast Inferencing in-the-Edge with Efficient Model Distribution
Figure 3 for Edge-Tailored Perception: Fast Inferencing in-the-Edge with Efficient Model Distribution
Figure 4 for Edge-Tailored Perception: Fast Inferencing in-the-Edge with Efficient Model Distribution
Viaarxiv icon

Collaborative Execution of Deep Neural Networks on Internet of Things Devices

Add code
Jan 08, 2019
Figure 1 for Collaborative Execution of Deep Neural Networks on Internet of Things Devices
Figure 2 for Collaborative Execution of Deep Neural Networks on Internet of Things Devices
Figure 3 for Collaborative Execution of Deep Neural Networks on Internet of Things Devices
Figure 4 for Collaborative Execution of Deep Neural Networks on Internet of Things Devices
Viaarxiv icon

Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices

Add code
Mar 21, 2018
Figure 1 for Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices
Figure 2 for Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices
Figure 3 for Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices
Figure 4 for Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices
Viaarxiv icon