Picture for Yan Song

Yan Song

ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning

Add code
Mar 12, 2025
Viaarxiv icon

ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning

Add code
Feb 22, 2025
Viaarxiv icon

From Target Tracking to Targeting Track -- Part II: Regularized Polynomial Trajectory Optimization

Add code
Feb 22, 2025
Viaarxiv icon

FIND: Fine-grained Information Density Guided Adaptive Retrieval-Augmented Generation for Disease Diagnosis

Add code
Feb 20, 2025
Viaarxiv icon

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Add code
Oct 12, 2024
Figure 1 for OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Figure 2 for OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Figure 3 for OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Figure 4 for OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
Viaarxiv icon

Efficient Reinforcement Learning with Large Language Model Priors

Add code
Oct 10, 2024
Figure 1 for Efficient Reinforcement Learning with Large Language Model Priors
Figure 2 for Efficient Reinforcement Learning with Large Language Model Priors
Figure 3 for Efficient Reinforcement Learning with Large Language Model Priors
Figure 4 for Efficient Reinforcement Learning with Large Language Model Priors
Viaarxiv icon

Prototype based Masked Audio Model for Self-Supervised Learning of Sound Event Detection

Add code
Sep 26, 2024
Viaarxiv icon

USTC-KXDIGIT System Description for ASVspoof5 Challenge

Add code
Sep 03, 2024
Figure 1 for USTC-KXDIGIT System Description for ASVspoof5 Challenge
Figure 2 for USTC-KXDIGIT System Description for ASVspoof5 Challenge
Figure 3 for USTC-KXDIGIT System Description for ASVspoof5 Challenge
Figure 4 for USTC-KXDIGIT System Description for ASVspoof5 Challenge
Viaarxiv icon

MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection

Add code
Aug 19, 2024
Viaarxiv icon

MAT-SED: AMasked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection

Add code
Aug 16, 2024
Viaarxiv icon