Picture for Yusuke Sugano

Yusuke Sugano

Domain-Adaptive Full-Face Gaze Estimation via Novel-View-Synthesis and Feature Disentanglement

Add code
May 25, 2023
Viaarxiv icon

Rotation-Constrained Cross-View Feature Fusion for Multi-View Appearance-based Gaze Estimation

Add code
May 22, 2023
Viaarxiv icon

Learning Video-independent Eye Contact Segmentation from In-the-Wild Videos

Add code
Oct 05, 2022
Figure 1 for Learning Video-independent Eye Contact Segmentation from In-the-Wild Videos
Figure 2 for Learning Video-independent Eye Contact Segmentation from In-the-Wild Videos
Figure 3 for Learning Video-independent Eye Contact Segmentation from In-the-Wild Videos
Figure 4 for Learning Video-independent Eye Contact Segmentation from In-the-Wild Videos
Viaarxiv icon

Learning-by-Novel-View-Synthesis for Full-Face Appearance-based 3D Gaze Estimation

Add code
Jan 23, 2022
Figure 1 for Learning-by-Novel-View-Synthesis for Full-Face Appearance-based 3D Gaze Estimation
Figure 2 for Learning-by-Novel-View-Synthesis for Full-Face Appearance-based 3D Gaze Estimation
Figure 3 for Learning-by-Novel-View-Synthesis for Full-Face Appearance-based 3D Gaze Estimation
Figure 4 for Learning-by-Novel-View-Synthesis for Full-Face Appearance-based 3D Gaze Estimation
Viaarxiv icon

Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips

Add code
Dec 02, 2021
Figure 1 for Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips
Figure 2 for Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips
Figure 3 for Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips
Figure 4 for Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips
Viaarxiv icon

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Add code
Oct 13, 2021
Figure 1 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 2 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 3 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 4 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Viaarxiv icon

EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2021: Team M3EM Technical Report

Add code
Jul 01, 2021
Figure 1 for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2021: Team M3EM Technical Report
Figure 2 for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2021: Team M3EM Technical Report
Figure 3 for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2021: Team M3EM Technical Report
Figure 4 for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2021: Team M3EM Technical Report
Viaarxiv icon

DRIV100: In-The-Wild Multi-Domain Dataset and Evaluation for Real-World Domain Adaptation of Semantic Segmentation

Add code
Feb 25, 2021
Figure 1 for DRIV100: In-The-Wild Multi-Domain Dataset and Evaluation for Real-World Domain Adaptation of Semantic Segmentation
Figure 2 for DRIV100: In-The-Wild Multi-Domain Dataset and Evaluation for Real-World Domain Adaptation of Semantic Segmentation
Figure 3 for DRIV100: In-The-Wild Multi-Domain Dataset and Evaluation for Real-World Domain Adaptation of Semantic Segmentation
Figure 4 for DRIV100: In-The-Wild Multi-Domain Dataset and Evaluation for Real-World Domain Adaptation of Semantic Segmentation
Viaarxiv icon

Shape-conditioned Image Generation by Learning Latent Appearance Representation from Unpaired Data

Add code
Nov 29, 2018
Figure 1 for Shape-conditioned Image Generation by Learning Latent Appearance Representation from Unpaired Data
Figure 2 for Shape-conditioned Image Generation by Learning Latent Appearance Representation from Unpaired Data
Figure 3 for Shape-conditioned Image Generation by Learning Latent Appearance Representation from Unpaired Data
Figure 4 for Shape-conditioned Image Generation by Learning Latent Appearance Representation from Unpaired Data
Viaarxiv icon

A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks

Add code
May 11, 2018
Figure 1 for A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks
Figure 2 for A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks
Figure 3 for A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks
Figure 4 for A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks
Viaarxiv icon