Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Boosting Protein Language Models with Negative Sample Mining

May 28, 2024

Yaoyao Xu, Xinjian Zhao, Xiaozhuang Song, Benyou Wang, Tianshu Yu

Figure 1 for Boosting Protein Language Models with Negative Sample Mining

Figure 2 for Boosting Protein Language Models with Negative Sample Mining

Figure 3 for Boosting Protein Language Models with Negative Sample Mining

Figure 4 for Boosting Protein Language Models with Negative Sample Mining

Share this with someone who'll enjoy it:

Abstract:We introduce a pioneering methodology for boosting large language models in the domain of protein representation learning. Our primary contribution lies in the refinement process for correlating the over-reliance on co-evolution knowledge, in a way that networks are trained to distill invaluable insights from negative samples, constituted by protein pairs sourced from disparate categories. By capitalizing on this novel approach, our technique steers the training of transformer-based models within the attention score space. This advanced strategy not only amplifies performance but also reflects the nuanced biological behaviors exhibited by proteins, offering aligned evidence with traditional biological mechanisms such as protein-protein interaction. We experimentally observed improved performance on various tasks over datasets, on top of several well-established large protein models. This innovative paradigm opens up promising horizons for further progress in the realms of protein research and computational biology.

* 17 pages, 4 figures

View paper on

Share this with someone who'll enjoy it:

Title:Boosting Protein Language Models with Negative Sample Mining

Paper and Code