Picture for Jiahao Zhao

Jiahao Zhao

Skip-Connected Policy Optimization for Implicit Advantage

Add code
Apr 09, 2026
Viaarxiv icon

SC-Arena: A Natural Language Benchmark for Single-Cell Reasoning with Knowledge-Augmented Evaluation

Add code
Feb 26, 2026
Viaarxiv icon

DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents

Add code
Feb 03, 2026
Viaarxiv icon

A Step to Decouple Optimization in 3DGS

Add code
Jan 26, 2026
Viaarxiv icon

Cognitive-YOLO: LLM-Driven Architecture Synthesis from First Principles of Data for Object Detection

Add code
Dec 13, 2025
Viaarxiv icon

RxSafeBench: Identifying Medication Safety Issues of Large Language Models in Simulated Consultation

Add code
Nov 06, 2025
Viaarxiv icon

Jinx: Unlimited LLMs for Probing Alignment Failures

Add code
Aug 12, 2025
Viaarxiv icon

R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

Tight Gap-Dependent Memory-Regret Trade-Off for Single-Pass Streaming Stochastic Multi-Armed Bandits

Add code
Mar 04, 2025
Figure 1 for Tight Gap-Dependent Memory-Regret Trade-Off for Single-Pass Streaming Stochastic Multi-Armed Bandits
Figure 2 for Tight Gap-Dependent Memory-Regret Trade-Off for Single-Pass Streaming Stochastic Multi-Armed Bandits
Viaarxiv icon

Streaming Piano Transcription Based on Consistent Onset and Offset Decoding with Sustain Pedal Detection

Add code
Mar 03, 2025
Viaarxiv icon