Picture for Benjia Zhou

Benjia Zhou

C${^2}$RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval

Add code
Aug 19, 2024
Figure 1 for C${^2}$RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval
Figure 2 for C${^2}$RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval
Figure 3 for C${^2}$RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval
Figure 4 for C${^2}$RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval
Viaarxiv icon

Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation

Add code
Mar 19, 2024
Viaarxiv icon

PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features

Add code
Dec 05, 2023
Figure 1 for PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features
Figure 2 for PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features
Figure 3 for PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features
Figure 4 for PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features
Viaarxiv icon

Multi-stage Factorized Spatio-Temporal Representation for RGB-D Action and Gesture Recognition

Add code
Sep 11, 2023
Viaarxiv icon

Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining

Add code
Jul 27, 2023
Viaarxiv icon

A Unified Multimodal De- and Re-coupling Framework for RGB-D Motion Recognition

Add code
Nov 16, 2022
Viaarxiv icon

Effective Vision Transformer Training: A Data-Centric Perspective

Add code
Sep 29, 2022
Figure 1 for Effective Vision Transformer Training: A Data-Centric Perspective
Figure 2 for Effective Vision Transformer Training: A Data-Centric Perspective
Figure 3 for Effective Vision Transformer Training: A Data-Centric Perspective
Figure 4 for Effective Vision Transformer Training: A Data-Centric Perspective
Viaarxiv icon

Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition

Add code
Dec 16, 2021
Figure 1 for Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition
Figure 2 for Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition
Figure 3 for Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition
Figure 4 for Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition
Viaarxiv icon

Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition

Add code
Mar 09, 2021
Figure 1 for Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition
Figure 2 for Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition
Figure 3 for Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition
Figure 4 for Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition
Viaarxiv icon

DSAM: A Distance Shrinking with Angular Marginalizing Loss for High Performance Vehicle Re-identificatio

Add code
Nov 25, 2020
Figure 1 for DSAM: A Distance Shrinking with Angular Marginalizing Loss for High Performance Vehicle Re-identificatio
Figure 2 for DSAM: A Distance Shrinking with Angular Marginalizing Loss for High Performance Vehicle Re-identificatio
Figure 3 for DSAM: A Distance Shrinking with Angular Marginalizing Loss for High Performance Vehicle Re-identificatio
Figure 4 for DSAM: A Distance Shrinking with Angular Marginalizing Loss for High Performance Vehicle Re-identificatio
Viaarxiv icon