Picture for Yuchen Wang

Yuchen Wang

MAR:Multi-Agent Reflexion Improves Reasoning Abilities in LLMs

Add code
Dec 23, 2025
Viaarxiv icon

GateFusion: Hierarchical Gated Cross-Modal Fusion for Active Speaker Detection

Add code
Dec 17, 2025
Viaarxiv icon

Dropout Prompt Learning: Towards Robust and Adaptive Vision-Language Models

Add code
Dec 08, 2025
Viaarxiv icon

Robust Decentralized Multi-armed Bandits: From Corruption-Resilience to Byzantine-Resilience

Add code
Nov 13, 2025
Viaarxiv icon

Deep Dubbing: End-to-End Auto-Audiobook System with Text-to-Timbre and Context-Aware Instruct-TTS

Add code
Sep 19, 2025
Figure 1 for Deep Dubbing: End-to-End Auto-Audiobook System with Text-to-Timbre and Context-Aware Instruct-TTS
Figure 2 for Deep Dubbing: End-to-End Auto-Audiobook System with Text-to-Timbre and Context-Aware Instruct-TTS
Figure 3 for Deep Dubbing: End-to-End Auto-Audiobook System with Text-to-Timbre and Context-Aware Instruct-TTS
Figure 4 for Deep Dubbing: End-to-End Auto-Audiobook System with Text-to-Timbre and Context-Aware Instruct-TTS
Viaarxiv icon

The Sound of Risk: A Multimodal Physics-Informed Acoustic Model for Forecasting Market Volatility and Enhancing Market Interpretability

Add code
Aug 26, 2025
Figure 1 for The Sound of Risk: A Multimodal Physics-Informed Acoustic Model for Forecasting Market Volatility and Enhancing Market Interpretability
Figure 2 for The Sound of Risk: A Multimodal Physics-Informed Acoustic Model for Forecasting Market Volatility and Enhancing Market Interpretability
Figure 3 for The Sound of Risk: A Multimodal Physics-Informed Acoustic Model for Forecasting Market Volatility and Enhancing Market Interpretability
Figure 4 for The Sound of Risk: A Multimodal Physics-Informed Acoustic Model for Forecasting Market Volatility and Enhancing Market Interpretability
Viaarxiv icon

Handling Imbalanced Pseudolabels for Vision-Language Models with Concept Alignment and Confusion-Aware Calibrated Margin

Add code
May 04, 2025
Viaarxiv icon

DexFlow: A Unified Approach for Dexterous Hand Pose Retargeting and Interaction

Add code
May 02, 2025
Viaarxiv icon

Multidimensional precipitation index prediction based on CNN-LSTM hybrid framework

Add code
Apr 29, 2025
Figure 1 for Multidimensional precipitation index prediction based on CNN-LSTM hybrid framework
Figure 2 for Multidimensional precipitation index prediction based on CNN-LSTM hybrid framework
Figure 3 for Multidimensional precipitation index prediction based on CNN-LSTM hybrid framework
Figure 4 for Multidimensional precipitation index prediction based on CNN-LSTM hybrid framework
Viaarxiv icon

From Code Generation to Software Testing: AI Copilot with Context-Based RAG

Add code
Apr 02, 2025
Viaarxiv icon