Picture for Kaiyan Zhang

Kaiyan Zhang

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Add code
Dec 23, 2024
Viaarxiv icon

How to Synthesize Text Data without Model Collapse?

Add code
Dec 19, 2024
Figure 1 for How to Synthesize Text Data without Model Collapse?
Figure 2 for How to Synthesize Text Data without Model Collapse?
Figure 3 for How to Synthesize Text Data without Model Collapse?
Figure 4 for How to Synthesize Text Data without Model Collapse?
Viaarxiv icon

Free Process Rewards without Process Labels

Add code
Dec 02, 2024
Figure 1 for Free Process Rewards without Process Labels
Figure 2 for Free Process Rewards without Process Labels
Figure 3 for Free Process Rewards without Process Labels
Figure 4 for Free Process Rewards without Process Labels
Viaarxiv icon

Automating Exploratory Proteomics Research via Language Models

Add code
Nov 06, 2024
Viaarxiv icon

Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention

Add code
Nov 04, 2024
Viaarxiv icon

A Static and Dynamic Attention Framework for Multi Turn Dialogue Generation

Add code
Oct 28, 2024
Viaarxiv icon

A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation

Add code
Oct 26, 2024
Figure 1 for A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation
Figure 2 for A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation
Figure 3 for A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation
Figure 4 for A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation
Viaarxiv icon

Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices

Add code
Oct 15, 2024
Figure 1 for Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
Figure 2 for Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
Figure 3 for Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
Figure 4 for Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
Viaarxiv icon

Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation

Add code
Jul 12, 2024
Figure 1 for Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Figure 2 for Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Figure 3 for Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Figure 4 for Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Viaarxiv icon

Towards Building Specialized Generalist AI with System 1 and System 2 Fusion

Add code
Jul 11, 2024
Viaarxiv icon