Picture for Puneet Gupta

Puneet Gupta

Beyond Universality: The GCC-FER Dataset and Culture-Aware Adaptation for Dynamic Facial Expression Recognition

Add code
Jun 05, 2026
Viaarxiv icon

Exploring Remote Photoplethysmography for Neonatal Pain Detection from Facial Videos

Add code
Apr 28, 2026
Viaarxiv icon

Efficient VQ-QAT and Mixed Vector/Linear quantized Neural Networks

Add code
Apr 25, 2026
Viaarxiv icon

Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference

Add code
Jul 19, 2024
Figure 1 for Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference
Figure 2 for Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference
Figure 3 for Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference
Figure 4 for Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference
Viaarxiv icon

FRED: Flexible REduction-Distribution Interconnect and Communication Implementation for Wafer-Scale Distributed Training of DNN Models

Add code
Jun 28, 2024
Figure 1 for FRED: Flexible REduction-Distribution Interconnect and Communication Implementation for Wafer-Scale Distributed Training of DNN Models
Figure 2 for FRED: Flexible REduction-Distribution Interconnect and Communication Implementation for Wafer-Scale Distributed Training of DNN Models
Figure 3 for FRED: Flexible REduction-Distribution Interconnect and Communication Implementation for Wafer-Scale Distributed Training of DNN Models
Figure 4 for FRED: Flexible REduction-Distribution Interconnect and Communication Implementation for Wafer-Scale Distributed Training of DNN Models
Viaarxiv icon

Cost-Driven Hardware-Software Co-Optimization of Machine Learning Pipelines

Add code
Oct 19, 2023
Figure 1 for Cost-Driven Hardware-Software Co-Optimization of Machine Learning Pipelines
Figure 2 for Cost-Driven Hardware-Software Co-Optimization of Machine Learning Pipelines
Figure 3 for Cost-Driven Hardware-Software Co-Optimization of Machine Learning Pipelines
Figure 4 for Cost-Driven Hardware-Software Co-Optimization of Machine Learning Pipelines
Viaarxiv icon

Training Neural Networks for Execution on Approximate Hardware

Add code
Apr 08, 2023
Figure 1 for Training Neural Networks for Execution on Approximate Hardware
Figure 2 for Training Neural Networks for Execution on Approximate Hardware
Figure 3 for Training Neural Networks for Execution on Approximate Hardware
Figure 4 for Training Neural Networks for Execution on Approximate Hardware
Viaarxiv icon

PhotoFourier: A Photonic Joint Transform Correlator-Based Neural Network Accelerator

Add code
Nov 10, 2022
Viaarxiv icon

Bit-serial Weight Pools: Compression and Arbitrary Precision Execution of Neural Networks on Resource Constrained Processors

Add code
Jan 25, 2022
Figure 1 for Bit-serial Weight Pools: Compression and Arbitrary Precision Execution of Neural Networks on Resource Constrained Processors
Figure 2 for Bit-serial Weight Pools: Compression and Arbitrary Precision Execution of Neural Networks on Resource Constrained Processors
Figure 3 for Bit-serial Weight Pools: Compression and Arbitrary Precision Execution of Neural Networks on Resource Constrained Processors
Figure 4 for Bit-serial Weight Pools: Compression and Arbitrary Precision Execution of Neural Networks on Resource Constrained Processors
Viaarxiv icon

Batch Processing and Data Streaming Fourier-based Convolutional Neural Network Accelerator

Add code
Dec 23, 2021
Figure 1 for Batch Processing and Data Streaming Fourier-based Convolutional Neural Network Accelerator
Figure 2 for Batch Processing and Data Streaming Fourier-based Convolutional Neural Network Accelerator
Figure 3 for Batch Processing and Data Streaming Fourier-based Convolutional Neural Network Accelerator
Figure 4 for Batch Processing and Data Streaming Fourier-based Convolutional Neural Network Accelerator
Viaarxiv icon