Picture for Muhammad Umer

Muhammad Umer

What If We Allocate Test-Time Compute Adaptively?

Add code
Feb 01, 2026
Viaarxiv icon

Continuous-Utility Direct Preference Optimization

Add code
Jan 31, 2026
Viaarxiv icon

On the Fundamental Limits of LLMs at Scale

Add code
Nov 17, 2025
Figure 1 for On the Fundamental Limits of LLMs at Scale
Figure 2 for On the Fundamental Limits of LLMs at Scale
Figure 3 for On the Fundamental Limits of LLMs at Scale
Figure 4 for On the Fundamental Limits of LLMs at Scale
Viaarxiv icon

Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey

Add code
Apr 20, 2025
Figure 1 for Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
Figure 2 for Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
Figure 3 for Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
Figure 4 for Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey
Viaarxiv icon

Resource Allocation for RIS-Assisted CoMP-NOMA Networks using Reinforcement Learning

Add code
Apr 01, 2025
Viaarxiv icon

Intelligent Spectrum Sharing in Integrated TN-NTNs: A Hierarchical Deep Reinforcement Learning Approach

Add code
Mar 09, 2025
Viaarxiv icon

Computation Offloading Strategies in Integrated Terrestrial and Non-Terrestrial Networks

Add code
Feb 21, 2025
Viaarxiv icon

Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial Networks

Add code
Jan 16, 2025
Viaarxiv icon

RIS-Assisted Aerial Non-Terrestrial Networks: An Intelligent Synergy with Deep Reinforcement Learning

Add code
Dec 25, 2024
Viaarxiv icon

Adversary Aware Continual Learning

Add code
Apr 27, 2023
Viaarxiv icon