Picture for Kai Yang

Kai Yang

Sherman

Adaptive Group Policy Optimization: Towards Stable Training and Token-Efficient Reasoning

Add code
Mar 20, 2025
Viaarxiv icon

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Add code
Mar 11, 2025
Viaarxiv icon

RGB-Phase Speckle: Cross-Scene Stereo 3D Reconstruction via Wrapped Pre-Normalization

Add code
Mar 08, 2025
Viaarxiv icon

An Algorithm Board in Neural Decoding

Add code
Feb 18, 2025
Viaarxiv icon

Affine Frequency Division Multiplexing: Extending OFDM for Scenario-Flexibility and Resilience

Add code
Feb 07, 2025
Figure 1 for Affine Frequency Division Multiplexing: Extending OFDM for Scenario-Flexibility and Resilience
Figure 2 for Affine Frequency Division Multiplexing: Extending OFDM for Scenario-Flexibility and Resilience
Figure 3 for Affine Frequency Division Multiplexing: Extending OFDM for Scenario-Flexibility and Resilience
Figure 4 for Affine Frequency Division Multiplexing: Extending OFDM for Scenario-Flexibility and Resilience
Viaarxiv icon

Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning

Add code
Dec 20, 2024
Viaarxiv icon

Unlocking TriLevel Learning with Level-Wise Zeroth Order Constraints: Distributed Algorithms and Provable Non-Asymptotic Convergence

Add code
Dec 10, 2024
Viaarxiv icon

Novelty-based Sample Reuse for Continuous Robotics Control

Add code
Oct 17, 2024
Figure 1 for Novelty-based Sample Reuse for Continuous Robotics Control
Figure 2 for Novelty-based Sample Reuse for Continuous Robotics Control
Figure 3 for Novelty-based Sample Reuse for Continuous Robotics Control
Figure 4 for Novelty-based Sample Reuse for Continuous Robotics Control
Viaarxiv icon

How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs

Add code
Oct 17, 2024
Figure 1 for How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Figure 2 for How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Figure 3 for How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Figure 4 for How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs
Viaarxiv icon

fastHDMI: Fast Mutual Information Estimation for High-Dimensional Data

Add code
Oct 14, 2024
Figure 1 for fastHDMI: Fast Mutual Information Estimation for High-Dimensional Data
Figure 2 for fastHDMI: Fast Mutual Information Estimation for High-Dimensional Data
Figure 3 for fastHDMI: Fast Mutual Information Estimation for High-Dimensional Data
Figure 4 for fastHDMI: Fast Mutual Information Estimation for High-Dimensional Data
Viaarxiv icon