Picture for Xiyuan Gao

Xiyuan Gao

SarcasmMiner: A Dual-Track Post-Training Framework for Robust Audio-Visual Sarcasm Reasoning

Add code
Mar 05, 2026
Viaarxiv icon

Making Machines Sound Sarcastic: LLM-Enhanced and Retrieval-Guided Sarcastic Speech Synthesis

Add code
Oct 08, 2025
Figure 1 for Making Machines Sound Sarcastic: LLM-Enhanced and Retrieval-Guided Sarcastic Speech Synthesis
Figure 2 for Making Machines Sound Sarcastic: LLM-Enhanced and Retrieval-Guided Sarcastic Speech Synthesis
Figure 3 for Making Machines Sound Sarcastic: LLM-Enhanced and Retrieval-Guided Sarcastic Speech Synthesis
Viaarxiv icon

Spoken in Jest, Detected in Earnest: A Systematic Review of Sarcasm Recognition -- Multimodal Fusion, Challenges, and Future Prospects

Add code
Sep 04, 2025
Figure 1 for Spoken in Jest, Detected in Earnest: A Systematic Review of Sarcasm Recognition -- Multimodal Fusion, Challenges, and Future Prospects
Figure 2 for Spoken in Jest, Detected in Earnest: A Systematic Review of Sarcasm Recognition -- Multimodal Fusion, Challenges, and Future Prospects
Figure 3 for Spoken in Jest, Detected in Earnest: A Systematic Review of Sarcasm Recognition -- Multimodal Fusion, Challenges, and Future Prospects
Figure 4 for Spoken in Jest, Detected in Earnest: A Systematic Review of Sarcasm Recognition -- Multimodal Fusion, Challenges, and Future Prospects
Viaarxiv icon

Asymmetric Reinforcing against Multi-modal Representation Bias

Add code
Jan 02, 2025
Figure 1 for Asymmetric Reinforcing against Multi-modal Representation Bias
Figure 2 for Asymmetric Reinforcing against Multi-modal Representation Bias
Figure 3 for Asymmetric Reinforcing against Multi-modal Representation Bias
Figure 4 for Asymmetric Reinforcing against Multi-modal Representation Bias
Viaarxiv icon

AMuSeD: An Attentive Deep Neural Network for Multimodal Sarcasm Detection Incorporating Bi-modal Data Augmentation

Add code
Dec 13, 2024
Figure 1 for AMuSeD: An Attentive Deep Neural Network for Multimodal Sarcasm Detection Incorporating Bi-modal Data Augmentation
Figure 2 for AMuSeD: An Attentive Deep Neural Network for Multimodal Sarcasm Detection Incorporating Bi-modal Data Augmentation
Figure 3 for AMuSeD: An Attentive Deep Neural Network for Multimodal Sarcasm Detection Incorporating Bi-modal Data Augmentation
Figure 4 for AMuSeD: An Attentive Deep Neural Network for Multimodal Sarcasm Detection Incorporating Bi-modal Data Augmentation
Viaarxiv icon

A Functional Trade-off between Prosodic and Semantic Cues in Conveying Sarcasm

Add code
Aug 27, 2024
Figure 1 for A Functional Trade-off between Prosodic and Semantic Cues in Conveying Sarcasm
Figure 2 for A Functional Trade-off between Prosodic and Semantic Cues in Conveying Sarcasm
Figure 3 for A Functional Trade-off between Prosodic and Semantic Cues in Conveying Sarcasm
Figure 4 for A Functional Trade-off between Prosodic and Semantic Cues in Conveying Sarcasm
Viaarxiv icon

Contextual Learning in Fourier Complex Field for VHR Remote Sensing Images

Add code
Oct 28, 2022
Figure 1 for Contextual Learning in Fourier Complex Field for VHR Remote Sensing Images
Figure 2 for Contextual Learning in Fourier Complex Field for VHR Remote Sensing Images
Figure 3 for Contextual Learning in Fourier Complex Field for VHR Remote Sensing Images
Figure 4 for Contextual Learning in Fourier Complex Field for VHR Remote Sensing Images
Viaarxiv icon