Picture for Carlos Segura

Carlos Segura

Seeing Eye to AI: Human Alignment via Gaze-Based Response Rewards for Large Language Models

Add code
Oct 02, 2024
Figure 1 for Seeing Eye to AI: Human Alignment via Gaze-Based Response Rewards for Large Language Models
Figure 2 for Seeing Eye to AI: Human Alignment via Gaze-Based Response Rewards for Large Language Models
Figure 3 for Seeing Eye to AI: Human Alignment via Gaze-Based Response Rewards for Large Language Models
Figure 4 for Seeing Eye to AI: Human Alignment via Gaze-Based Response Rewards for Large Language Models
Viaarxiv icon

Diffusion Models for Tabular Data Imputation and Synthetic Data Generation

Add code
Jul 02, 2024
Viaarxiv icon

Future Trends in the Design of Memetic Algorithms: the Case of the Linear Ordering Problem

Add code
May 14, 2024
Viaarxiv icon

Robust Wake-Up Word Detection by Two-stage Multi-resolution Ensembles

Add code
Oct 17, 2023
Figure 1 for Robust Wake-Up Word Detection by Two-stage Multi-resolution Ensembles
Figure 2 for Robust Wake-Up Word Detection by Two-stage Multi-resolution Ensembles
Figure 3 for Robust Wake-Up Word Detection by Two-stage Multi-resolution Ensembles
Figure 4 for Robust Wake-Up Word Detection by Two-stage Multi-resolution Ensembles
Viaarxiv icon

Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning

Add code
Jan 31, 2023
Figure 1 for Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning
Figure 2 for Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning
Figure 3 for Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning
Figure 4 for Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning
Viaarxiv icon

Efficient Keyword Spotting through long-range interactions with Temporal Lambda Networks

Add code
Apr 16, 2021
Figure 1 for Efficient Keyword Spotting through long-range interactions with Temporal Lambda Networks
Figure 2 for Efficient Keyword Spotting through long-range interactions with Temporal Lambda Networks
Figure 3 for Efficient Keyword Spotting through long-range interactions with Temporal Lambda Networks
Figure 4 for Efficient Keyword Spotting through long-range interactions with Temporal Lambda Networks
Viaarxiv icon

Speech Enhancement for Wake-Up-Word detection in Voice Assistants

Add code
Jan 29, 2021
Figure 1 for Speech Enhancement for Wake-Up-Word detection in Voice Assistants
Figure 2 for Speech Enhancement for Wake-Up-Word detection in Voice Assistants
Figure 3 for Speech Enhancement for Wake-Up-Word detection in Voice Assistants
Figure 4 for Speech Enhancement for Wake-Up-Word detection in Voice Assistants
Viaarxiv icon

Enabling Zero-shot Multilingual Spoken Language Translation with Language-Specific Encoders and Decoders

Add code
Nov 02, 2020
Figure 1 for Enabling Zero-shot Multilingual Spoken Language Translation with Language-Specific Encoders and Decoders
Figure 2 for Enabling Zero-shot Multilingual Spoken Language Translation with Language-Specific Encoders and Decoders
Figure 3 for Enabling Zero-shot Multilingual Spoken Language Translation with Language-Specific Encoders and Decoders
Figure 4 for Enabling Zero-shot Multilingual Spoken Language Translation with Language-Specific Encoders and Decoders
Viaarxiv icon

Seeing and Hearing Egocentric Actions: How Much Can We Learn?

Add code
Oct 15, 2019
Figure 1 for Seeing and Hearing Egocentric Actions: How Much Can We Learn?
Figure 2 for Seeing and Hearing Egocentric Actions: How Much Can We Learn?
Figure 3 for Seeing and Hearing Egocentric Actions: How Much Can We Learn?
Figure 4 for Seeing and Hearing Egocentric Actions: How Much Can We Learn?
Viaarxiv icon

Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion

Add code
Jun 03, 2019
Figure 1 for Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion
Figure 2 for Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion
Figure 3 for Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion
Figure 4 for Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion
Viaarxiv icon