Picture for Andrey Guzhov

Andrey Guzhov

AudioCLIP: Extending CLIP to Image, Text and Audio

Add code
Jun 24, 2021
Figure 1 for AudioCLIP: Extending CLIP to Image, Text and Audio
Figure 2 for AudioCLIP: Extending CLIP to Image, Text and Audio
Figure 3 for AudioCLIP: Extending CLIP to Image, Text and Audio
Figure 4 for AudioCLIP: Extending CLIP to Image, Text and Audio
Viaarxiv icon

ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio

Add code
Apr 23, 2021
Figure 1 for ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio
Figure 2 for ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio
Figure 3 for ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio
Figure 4 for ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio
Viaarxiv icon

ESResNet: Environmental Sound Classification Based on Visual Domain Models

Add code
Apr 15, 2020
Figure 1 for ESResNet: Environmental Sound Classification Based on Visual Domain Models
Figure 2 for ESResNet: Environmental Sound Classification Based on Visual Domain Models
Figure 3 for ESResNet: Environmental Sound Classification Based on Visual Domain Models
Figure 4 for ESResNet: Environmental Sound Classification Based on Visual Domain Models
Viaarxiv icon