Picture for Shengqiang Liu

Shengqiang Liu

M$^{3}$V: A multi-modal multi-view approach for Device-Directed Speech Detection

Add code
Sep 14, 2024
Viaarxiv icon

DSCLAP: Domain-Specific Contrastive Language-Audio Pre-Training

Add code
Sep 14, 2024
Figure 1 for DSCLAP: Domain-Specific Contrastive Language-Audio Pre-Training
Figure 2 for DSCLAP: Domain-Specific Contrastive Language-Audio Pre-Training
Figure 3 for DSCLAP: Domain-Specific Contrastive Language-Audio Pre-Training
Figure 4 for DSCLAP: Domain-Specific Contrastive Language-Audio Pre-Training
Viaarxiv icon

Turbo your multi-modal classification with contrastive learning

Add code
Sep 14, 2024
Viaarxiv icon

Point-Voxel Transformer: An Efficient Approach To 3D Deep Learning

Add code
Aug 13, 2021
Figure 1 for Point-Voxel Transformer: An Efficient Approach To 3D Deep Learning
Figure 2 for Point-Voxel Transformer: An Efficient Approach To 3D Deep Learning
Figure 3 for Point-Voxel Transformer: An Efficient Approach To 3D Deep Learning
Figure 4 for Point-Voxel Transformer: An Efficient Approach To 3D Deep Learning
Viaarxiv icon