Picture for Bowen Zhou

Bowen Zhou

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Add code
Feb 10, 2025
Viaarxiv icon

Process Reinforcement through Implicit Rewards

Add code
Feb 03, 2025
Viaarxiv icon

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Add code
Jan 30, 2025
Viaarxiv icon

The 1st SpeechWellness Challenge: Detecting Suicidal Risk Among Adolescents

Add code
Jan 11, 2025
Viaarxiv icon

Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback

Add code
Jan 07, 2025
Figure 1 for Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback
Figure 2 for Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback
Figure 3 for Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback
Figure 4 for Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback
Viaarxiv icon

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Add code
Dec 23, 2024
Viaarxiv icon

How to Synthesize Text Data without Model Collapse?

Add code
Dec 19, 2024
Figure 1 for How to Synthesize Text Data without Model Collapse?
Figure 2 for How to Synthesize Text Data without Model Collapse?
Figure 3 for How to Synthesize Text Data without Model Collapse?
Figure 4 for How to Synthesize Text Data without Model Collapse?
Viaarxiv icon

Free Process Rewards without Process Labels

Add code
Dec 02, 2024
Figure 1 for Free Process Rewards without Process Labels
Figure 2 for Free Process Rewards without Process Labels
Figure 3 for Free Process Rewards without Process Labels
Figure 4 for Free Process Rewards without Process Labels
Viaarxiv icon

Less is More: Efficient Model Merging with Binary Task Switch

Add code
Nov 24, 2024
Viaarxiv icon

Automating Exploratory Proteomics Research via Language Models

Add code
Nov 06, 2024
Viaarxiv icon