Picture for Gonçalo Mordido

Gonçalo Mordido

Exploring Quantization for Efficient Pre-Training of Transformer Language Models

Add code
Jul 16, 2024
Figure 1 for Exploring Quantization for Efficient Pre-Training of Transformer Language Models
Figure 2 for Exploring Quantization for Efficient Pre-Training of Transformer Language Models
Figure 3 for Exploring Quantization for Efficient Pre-Training of Transformer Language Models
Figure 4 for Exploring Quantization for Efficient Pre-Training of Transformer Language Models
Viaarxiv icon

Lookbehind Optimizer: k steps back, 1 step forward

Add code
Jul 31, 2023
Viaarxiv icon

Promoting Exploration in Memory-Augmented Adam using Critical Momenta

Add code
Jul 18, 2023
Viaarxiv icon

Sharpness-Aware Training for Accurate Inference on Noisy DNN Accelerators

Add code
Nov 18, 2022
Viaarxiv icon

Improving Meta-Learning Generalization with Activation-Based Early-Stopping

Add code
Aug 03, 2022
Figure 1 for Improving Meta-Learning Generalization with Activation-Based Early-Stopping
Figure 2 for Improving Meta-Learning Generalization with Activation-Based Early-Stopping
Figure 3 for Improving Meta-Learning Generalization with Activation-Based Early-Stopping
Figure 4 for Improving Meta-Learning Generalization with Activation-Based Early-Stopping
Viaarxiv icon

MemSE: Fast MSE Prediction for Noisy Memristor-Based DNN Accelerators

Add code
May 03, 2022
Figure 1 for MemSE: Fast MSE Prediction for Noisy Memristor-Based DNN Accelerators
Figure 2 for MemSE: Fast MSE Prediction for Noisy Memristor-Based DNN Accelerators
Figure 3 for MemSE: Fast MSE Prediction for Noisy Memristor-Based DNN Accelerators
Figure 4 for MemSE: Fast MSE Prediction for Noisy Memristor-Based DNN Accelerators
Viaarxiv icon

Compressing 1D Time-Channel Separable Convolutions using Sparse Random Ternary Matrices

Add code
Apr 02, 2021
Figure 1 for Compressing 1D Time-Channel Separable Convolutions using Sparse Random Ternary Matrices
Figure 2 for Compressing 1D Time-Channel Separable Convolutions using Sparse Random Ternary Matrices
Figure 3 for Compressing 1D Time-Channel Separable Convolutions using Sparse Random Ternary Matrices
Figure 4 for Compressing 1D Time-Channel Separable Convolutions using Sparse Random Ternary Matrices
Viaarxiv icon

Evaluating Post-Training Compression in GANs using Locality-Sensitive Hashing

Add code
Mar 22, 2021
Viaarxiv icon

Mark-Evaluate: Assessing Language Generation using Population Estimation Methods

Add code
Oct 09, 2020
Figure 1 for Mark-Evaluate: Assessing Language Generation using Population Estimation Methods
Figure 2 for Mark-Evaluate: Assessing Language Generation using Population Estimation Methods
Figure 3 for Mark-Evaluate: Assessing Language Generation using Population Estimation Methods
Figure 4 for Mark-Evaluate: Assessing Language Generation using Population Estimation Methods
Viaarxiv icon

Improving the Evaluation of Generative Models with Fuzzy Logic

Add code
Feb 03, 2020
Figure 1 for Improving the Evaluation of Generative Models with Fuzzy Logic
Figure 2 for Improving the Evaluation of Generative Models with Fuzzy Logic
Figure 3 for Improving the Evaluation of Generative Models with Fuzzy Logic
Figure 4 for Improving the Evaluation of Generative Models with Fuzzy Logic
Viaarxiv icon