Picture for Guoxin Wang

Guoxin Wang

JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation

Add code
Nov 14, 2024
Viaarxiv icon

JoyType: A Robust Design for Multilingual Visual Text Creation

Add code
Sep 26, 2024
Figure 1 for JoyType: A Robust Design for Multilingual Visual Text Creation
Figure 2 for JoyType: A Robust Design for Multilingual Visual Text Creation
Figure 3 for JoyType: A Robust Design for Multilingual Visual Text Creation
Figure 4 for JoyType: A Robust Design for Multilingual Visual Text Creation
Viaarxiv icon

MHAD: Multimodal Home Activity Dataset with Multi-Angle Videos and Synchronized Physiological Signals

Add code
Sep 14, 2024
Viaarxiv icon

ECG Biometric Authentication Using Self-Supervised Learning for IoT Edge Sensors

Add code
Sep 09, 2024
Figure 1 for ECG Biometric Authentication Using Self-Supervised Learning for IoT Edge Sensors
Figure 2 for ECG Biometric Authentication Using Self-Supervised Learning for IoT Edge Sensors
Figure 3 for ECG Biometric Authentication Using Self-Supervised Learning for IoT Edge Sensors
Figure 4 for ECG Biometric Authentication Using Self-Supervised Learning for IoT Edge Sensors
Viaarxiv icon

3SHNet: Boosting Image-Sentence Retrieval via Visual Semantic-Spatial Self-Highlighting

Add code
Apr 26, 2024
Viaarxiv icon

PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation

Add code
Mar 14, 2024
Viaarxiv icon

A Bi-Pyramid Multimodal Fusion Method for the Diagnosis of Bipolar Disorders

Add code
Jan 15, 2024
Viaarxiv icon

Unsupervised Pre-Training Using Masked Autoencoders for ECG Analysis

Add code
Oct 17, 2023
Viaarxiv icon

Multi-Dimension-Embedding-Aware Modality Fusion Transformer for Psychiatric Disorder Clasification

Add code
Oct 04, 2023
Viaarxiv icon

Kosmos-2.5: A Multimodal Literate Model

Add code
Sep 20, 2023
Viaarxiv icon