Picture for Haoxiang Shi

Haoxiang Shi

ESARM: 3D Emotional Speech-to-Animation via Reward Model from Automatically-Ranked Demonstrations

Add code
Nov 20, 2024
Viaarxiv icon

AlignCap: Aligning Speech Emotion Captioning to Human Preferences

Add code
Oct 24, 2024
Viaarxiv icon

Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning

Add code
May 28, 2024
Figure 1 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 2 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 3 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Figure 4 for Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning
Viaarxiv icon

RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval

Add code
May 28, 2024
Figure 1 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Figure 2 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Figure 3 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Figure 4 for RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval
Viaarxiv icon

RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis

Add code
May 27, 2024
Viaarxiv icon

CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models

Add code
May 20, 2024
Viaarxiv icon

EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech

Add code
Mar 17, 2024
Figure 1 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Figure 2 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Figure 3 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Figure 4 for EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech
Viaarxiv icon

Towards Consistency Filtering-Free Unsupervised Learning for Dense Retrieval

Add code
Aug 05, 2023
Viaarxiv icon

Is ChatGPT a Good NLG Evaluator? A Preliminary Study

Add code
Mar 07, 2023
Figure 1 for Is ChatGPT a Good NLG Evaluator? A Preliminary Study
Figure 2 for Is ChatGPT a Good NLG Evaluator? A Preliminary Study
Figure 3 for Is ChatGPT a Good NLG Evaluator? A Preliminary Study
Figure 4 for Is ChatGPT a Good NLG Evaluator? A Preliminary Study
Viaarxiv icon

GOAL: Towards Benchmarking Few-Shot Sports Game Summarization

Add code
Jul 18, 2022
Figure 1 for GOAL: Towards Benchmarking Few-Shot Sports Game Summarization
Figure 2 for GOAL: Towards Benchmarking Few-Shot Sports Game Summarization
Figure 3 for GOAL: Towards Benchmarking Few-Shot Sports Game Summarization
Figure 4 for GOAL: Towards Benchmarking Few-Shot Sports Game Summarization
Viaarxiv icon