Picture for Fei Ma

Fei Ma

FinZero: Launching Multi-modal Financial Time Series Forecast with Large Reasoning Model

Add code
Sep 10, 2025
Viaarxiv icon

TransMPC: Transformer-based Explicit MPC with Variable Prediction Horizon

Add code
Sep 09, 2025
Viaarxiv icon

Human Motion Video Generation: A Survey

Add code
Sep 04, 2025
Viaarxiv icon

Contrastive Prompt Clustering for Weakly Supervised Semantic Segmentation

Add code
Aug 23, 2025
Viaarxiv icon

Invert4TVG: A Temporal Video Grounding Framework with Inversion Tasks for Enhanced Action Understanding

Add code
Aug 10, 2025
Viaarxiv icon

Active Multimodal Distillation for Few-shot Action Recognition

Add code
Jun 16, 2025
Viaarxiv icon

VReST: Enhancing Reasoning in Large Vision-Language Models through Tree Search and Self-Reward Mechanism

Add code
Jun 10, 2025
Viaarxiv icon

SatelliteFormula: Multi-Modal Symbolic Regression from Remote Sensing Imagery for Physics Discovery

Add code
Jun 06, 2025
Viaarxiv icon

Universal Visuo-Tactile Video Understanding for Embodied Interaction

Add code
May 28, 2025
Viaarxiv icon

Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling

Add code
May 21, 2025
Viaarxiv icon