Picture for Ruoyu Sun

Ruoyu Sun

Xi'an Jiaotong-Liverpool University, School of Mathematics and Physics, Department of Financial and Actuarial Mathematics

A novel multi-agent dynamic portfolio optimization learning system based on hierarchical deep reinforcement learning

Add code
Jan 12, 2025
Viaarxiv icon

Enabling Scalable Oversight via Self-Evolving Critic

Add code
Jan 10, 2025
Viaarxiv icon

Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion

Add code
Dec 16, 2024
Figure 1 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Figure 2 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Figure 3 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Figure 4 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Viaarxiv icon

An Efficient Unsupervised Framework for Convex Quadratic Programs via Deep Unrolling

Add code
Dec 02, 2024
Viaarxiv icon

Archilles' Heel in Semi-open LLMs: Hiding Bottom against Recovery Attacks

Add code
Oct 15, 2024
Figure 1 for Archilles' Heel in Semi-open LLMs: Hiding Bottom against Recovery Attacks
Figure 2 for Archilles' Heel in Semi-open LLMs: Hiding Bottom against Recovery Attacks
Figure 3 for Archilles' Heel in Semi-open LLMs: Hiding Bottom against Recovery Attacks
Figure 4 for Archilles' Heel in Semi-open LLMs: Hiding Bottom against Recovery Attacks
Viaarxiv icon

Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity

Add code
Aug 29, 2024
Viaarxiv icon

MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning

Add code
Jul 31, 2024
Viaarxiv icon

Adam-mini: Use Fewer Learning Rates To Gain More

Add code
Jun 26, 2024
Viaarxiv icon

Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization

Add code
Jun 08, 2024
Viaarxiv icon

PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming

Add code
Jun 04, 2024
Figure 1 for PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming
Figure 2 for PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming
Figure 3 for PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming
Figure 4 for PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming
Viaarxiv icon