Picture for Junqi Gao

Junqi Gao

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Add code
Apr 01, 2025
Viaarxiv icon

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Add code
Feb 10, 2025
Viaarxiv icon

Fast and Slow Gradient Approximation for Binary Neural Network Optimization

Add code
Dec 16, 2024
Figure 1 for Fast and Slow Gradient Approximation for Binary Neural Network Optimization
Figure 2 for Fast and Slow Gradient Approximation for Binary Neural Network Optimization
Figure 3 for Fast and Slow Gradient Approximation for Binary Neural Network Optimization
Figure 4 for Fast and Slow Gradient Approximation for Binary Neural Network Optimization
Viaarxiv icon

Less is More: Efficient Model Merging with Binary Task Switch

Add code
Nov 24, 2024
Viaarxiv icon

An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning

Add code
Nov 11, 2024
Viaarxiv icon

SR-CIS: Self-Reflective Incremental System with Decoupled Memory and Reasoning

Add code
Aug 04, 2024
Viaarxiv icon

Perturbation Towards Easy Samples Improves Targeted Adversarial Transferability

Add code
Jun 08, 2024
Viaarxiv icon

Enhancing Adversarial Transferability via Information Bottleneck Constraints

Add code
Jun 08, 2024
Figure 1 for Enhancing Adversarial Transferability via Information Bottleneck Constraints
Figure 2 for Enhancing Adversarial Transferability via Information Bottleneck Constraints
Figure 3 for Enhancing Adversarial Transferability via Information Bottleneck Constraints
Figure 4 for Enhancing Adversarial Transferability via Information Bottleneck Constraints
Viaarxiv icon

Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing

Add code
Jun 08, 2024
Viaarxiv icon

SMR: State Memory Replay for Long Sequence Modeling

Add code
May 27, 2024
Viaarxiv icon