Picture for Biqing Qi

Biqing Qi

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Add code
Feb 10, 2025
Viaarxiv icon

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Add code
Dec 23, 2024
Viaarxiv icon

Fast and Slow Gradient Approximation for Binary Neural Network Optimization

Add code
Dec 16, 2024
Figure 1 for Fast and Slow Gradient Approximation for Binary Neural Network Optimization
Figure 2 for Fast and Slow Gradient Approximation for Binary Neural Network Optimization
Figure 3 for Fast and Slow Gradient Approximation for Binary Neural Network Optimization
Figure 4 for Fast and Slow Gradient Approximation for Binary Neural Network Optimization
Viaarxiv icon

Less is More: Efficient Model Merging with Binary Task Switch

Add code
Nov 24, 2024
Viaarxiv icon

An Efficient Memory Module for Graph Few-Shot Class-Incremental Learning

Add code
Nov 11, 2024
Viaarxiv icon

On the token distance modeling ability of higher RoPE attention dimension

Add code
Oct 11, 2024
Figure 1 for On the token distance modeling ability of higher RoPE attention dimension
Figure 2 for On the token distance modeling ability of higher RoPE attention dimension
Figure 3 for On the token distance modeling ability of higher RoPE attention dimension
Figure 4 for On the token distance modeling ability of higher RoPE attention dimension
Viaarxiv icon

MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making

Add code
Sep 25, 2024
Viaarxiv icon

SR-CIS: Self-Reflective Incremental System with Decoupled Memory and Reasoning

Add code
Aug 04, 2024
Viaarxiv icon

Safe-SD: Safe and Traceable Stable Diffusion with Text Prompt Trigger for Invisible Generative Watermarking

Add code
Jul 19, 2024
Viaarxiv icon

Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation

Add code
Jul 12, 2024
Figure 1 for Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Figure 2 for Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Figure 3 for Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Figure 4 for Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Viaarxiv icon