Picture for Xinhan Di

Xinhan Di

Low-Rank Adaptation with Task-Relevant Feature Enhancement for Fine-tuning Language Models

Add code
Dec 13, 2024
Viaarxiv icon

YingSound: Video-Guided Sound Effects Generation with Multi-modal Chain-of-Thought Controls

Add code
Dec 12, 2024
Viaarxiv icon

Multi-Stage Graph Learning for fMRI Analysis to Diagnose Neuro-Developmental Disorders

Add code
Oct 07, 2024
Figure 1 for Multi-Stage Graph Learning for fMRI Analysis to Diagnose Neuro-Developmental Disorders
Figure 2 for Multi-Stage Graph Learning for fMRI Analysis to Diagnose Neuro-Developmental Disorders
Figure 3 for Multi-Stage Graph Learning for fMRI Analysis to Diagnose Neuro-Developmental Disorders
Figure 4 for Multi-Stage Graph Learning for fMRI Analysis to Diagnose Neuro-Developmental Disorders
Viaarxiv icon

OCC-MLLM-Alpha:Empowering Multi-modal Large Language Model for the Understanding of Occluded Objects with Self-Supervised Test-Time Learning

Add code
Oct 02, 2024
Viaarxiv icon

OCC-MLLM:Empowering Multimodal Large Language Model For the Understanding of Occluded Objects

Add code
Oct 02, 2024
Viaarxiv icon

Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation

Add code
Oct 01, 2024
Viaarxiv icon

Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation

Add code
Sep 26, 2024
Figure 1 for Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation
Figure 2 for Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation
Figure 3 for Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation
Figure 4 for Self-Supervised Learning of Deviation in Latent Representation for Co-speech Gesture Video Generation
Viaarxiv icon

Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation

Add code
Aug 01, 2024
Figure 1 for Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation
Figure 2 for Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation
Figure 3 for Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation
Figure 4 for Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation
Viaarxiv icon

Hierarchical Reinforcement Learning for Furniture Layout in Virtual Indoor Scenes

Add code
Oct 19, 2022
Figure 1 for Hierarchical Reinforcement Learning for Furniture Layout in Virtual Indoor Scenes
Figure 2 for Hierarchical Reinforcement Learning for Furniture Layout in Virtual Indoor Scenes
Figure 3 for Hierarchical Reinforcement Learning for Furniture Layout in Virtual Indoor Scenes
Figure 4 for Hierarchical Reinforcement Learning for Furniture Layout in Virtual Indoor Scenes
Viaarxiv icon

LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction

Add code
Aug 27, 2022
Figure 1 for LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction
Figure 2 for LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction
Figure 3 for LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction
Figure 4 for LWA-HAND: Lightweight Attention Hand for Interacting Hand Reconstruction
Viaarxiv icon