Picture for Jongse Park

Jongse Park

LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale

Add code
Aug 10, 2024
Viaarxiv icon

DaCapo: Accelerating Continuous Learning in Autonomous Systems for Video Analytics

Add code
Mar 21, 2024
Viaarxiv icon

Accelerating String-Key Learned Index Structures via Memoization-based Incremental Training

Add code
Mar 18, 2024
Viaarxiv icon

CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics

Add code
Jul 02, 2022
Figure 1 for CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics
Figure 2 for CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics
Figure 3 for CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics
Figure 4 for CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics
Viaarxiv icon

FlexBlock: A Flexible DNN Training Accelerator with Multi-Mode Block Floating Point Support

Add code
Mar 13, 2022
Figure 1 for FlexBlock: A Flexible DNN Training Accelerator with Multi-Mode Block Floating Point Support
Figure 2 for FlexBlock: A Flexible DNN Training Accelerator with Multi-Mode Block Floating Point Support
Figure 3 for FlexBlock: A Flexible DNN Training Accelerator with Multi-Mode Block Floating Point Support
Figure 4 for FlexBlock: A Flexible DNN Training Accelerator with Multi-Mode Block Floating Point Support
Viaarxiv icon

Multi-model Machine Learning Inference Serving with GPU Spatial Partitioning

Add code
Sep 01, 2021
Figure 1 for Multi-model Machine Learning Inference Serving with GPU Spatial Partitioning
Figure 2 for Multi-model Machine Learning Inference Serving with GPU Spatial Partitioning
Figure 3 for Multi-model Machine Learning Inference Serving with GPU Spatial Partitioning
Figure 4 for Multi-model Machine Learning Inference Serving with GPU Spatial Partitioning
Viaarxiv icon

Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks

Add code
May 30, 2018
Figure 1 for Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks
Figure 2 for Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks
Figure 3 for Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks
Figure 4 for Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks
Viaarxiv icon