Picture for Keitaro Tanaka

Keitaro Tanaka

Formula-Supervised Sound Event Detection: Pre-Training Without Real Data

Add code
Apr 06, 2025
Viaarxiv icon

SyncViolinist: Music-Oriented Violin Motion Generation Based on Bowing and Fingering

Add code
Dec 11, 2024
Figure 1 for SyncViolinist: Music-Oriented Violin Motion Generation Based on Bowing and Fingering
Figure 2 for SyncViolinist: Music-Oriented Violin Motion Generation Based on Bowing and Fingering
Figure 3 for SyncViolinist: Music-Oriented Violin Motion Generation Based on Bowing and Fingering
Figure 4 for SyncViolinist: Music-Oriented Violin Motion Generation Based on Bowing and Fingering
Viaarxiv icon

Gaze-Driven Sentence Simplification for Language Learners: Enhancing Comprehension and Readability

Add code
Sep 30, 2023
Figure 1 for Gaze-Driven Sentence Simplification for Language Learners: Enhancing Comprehension and Readability
Figure 2 for Gaze-Driven Sentence Simplification for Language Learners: Enhancing Comprehension and Readability
Figure 3 for Gaze-Driven Sentence Simplification for Language Learners: Enhancing Comprehension and Readability
Figure 4 for Gaze-Driven Sentence Simplification for Language Learners: Enhancing Comprehension and Readability
Viaarxiv icon

Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction

Add code
Jun 10, 2023
Figure 1 for Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction
Figure 2 for Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction
Figure 3 for Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction
Figure 4 for Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction
Viaarxiv icon

Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning

Add code
May 23, 2023
Figure 1 for Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning
Figure 2 for Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning
Figure 3 for Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning
Figure 4 for Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning
Viaarxiv icon

Memory Efficient Diffusion Probabilistic Models via Patch-based Generation

Add code
Apr 14, 2023
Viaarxiv icon

Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex

Add code
Jun 16, 2021
Figure 1 for Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex
Figure 2 for Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex
Figure 3 for Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex
Figure 4 for Manifold-Aware Deep Clustering: Maximizing Angles between Embedding Vectors Based on Regular Simplex
Viaarxiv icon