Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Viet Thanh Duy Nguyen

Advances in Protein Representation Learning: Methods, Applications, and Future Directions

Mar 20, 2025

Viet Thanh Duy Nguyen, Truong-Son Hy

Abstract:Proteins are complex biomolecules that play a central role in various biological processes, making them critical targets for breakthroughs in molecular biology, medical research, and drug discovery. Deciphering their intricate, hierarchical structures, and diverse functions is essential for advancing our understanding of life at the molecular level. Protein Representation Learning (PRL) has emerged as a transformative approach, enabling the extraction of meaningful computational representations from protein data to address these challenges. In this paper, we provide a comprehensive review of PRL research, categorizing methodologies into five key areas: feature-based, sequence-based, structure-based, multimodal, and complex-based approaches. To support researchers in this rapidly evolving field, we introduce widely used databases for protein sequences, structures, and functions, which serve as essential resources for model development and evaluation. We also explore the diverse applications of these approaches in multiple domains, demonstrating their broad impact. Finally, we discuss pressing technical challenges and outline future directions to advance PRL, offering insights to inspire continued innovation in this foundational field.

Via

Access Paper or Ask Questions

Multimodal Contrastive Representation Learning in Augmented Biomedical Knowledge Graphs

Jan 03, 2025

Tien Dang, Viet Thanh Duy Nguyen, Minh Tuan Le, Truong-Son Hy

Figure 1 for Multimodal Contrastive Representation Learning in Augmented Biomedical Knowledge Graphs

Figure 2 for Multimodal Contrastive Representation Learning in Augmented Biomedical Knowledge Graphs

Figure 3 for Multimodal Contrastive Representation Learning in Augmented Biomedical Knowledge Graphs

Figure 4 for Multimodal Contrastive Representation Learning in Augmented Biomedical Knowledge Graphs

Abstract:Biomedical Knowledge Graphs (BKGs) integrate diverse datasets to elucidate complex relationships within the biomedical field. Effective link prediction on these graphs can uncover valuable connections, such as potential novel drug-disease relations. We introduce a novel multimodal approach that unifies embeddings from specialized Language Models (LMs) with Graph Contrastive Learning (GCL) to enhance intra-entity relationships while employing a Knowledge Graph Embedding (KGE) model to capture inter-entity relationships for effective link prediction. To address limitations in existing BKGs, we present PrimeKG++, an enriched knowledge graph incorporating multimodal data, including biological sequences and textual descriptions for each entity type. By combining semantic and relational information in a unified representation, our approach demonstrates strong generalizability, enabling accurate link predictions even for unseen nodes. Experimental results on PrimeKG++ and the DrugBank drug-target interaction dataset demonstrate the effectiveness and robustness of our method across diverse biomedical datasets. Our source code, pre-trained models, and data are publicly available at https://github.com/HySonLab/BioMedKG

Via

Access Paper or Ask Questions