Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Egbert Castro

ProtSCAPE: Mapping the landscape of protein conformations in molecular dynamics

Oct 27, 2024

Siddharth Viswanath, Dhananjay Bhaskar, David R. Johnson, Joao Felipe Rocha, Egbert Castro, Jackson D. Grady, Alex T. Grigas, Michael A. Perlmutter, Corey S. O'Hern, Smita Krishnaswamy

Figure 1 for ProtSCAPE: Mapping the landscape of protein conformations in molecular dynamics

Figure 2 for ProtSCAPE: Mapping the landscape of protein conformations in molecular dynamics

Figure 3 for ProtSCAPE: Mapping the landscape of protein conformations in molecular dynamics

Figure 4 for ProtSCAPE: Mapping the landscape of protein conformations in molecular dynamics

Abstract:Understanding the dynamic nature of protein structures is essential for comprehending their biological functions. While significant progress has been made in predicting static folded structures, modeling protein motions on microsecond to millisecond scales remains challenging. To address these challenges, we introduce a novel deep learning architecture, Protein Transformer with Scattering, Attention, and Positional Embedding (ProtSCAPE), which leverages the geometric scattering transform alongside transformer-based attention mechanisms to capture protein dynamics from molecular dynamics (MD) simulations. ProtSCAPE utilizes the multi-scale nature of the geometric scattering transform to extract features from protein structures conceptualized as graphs and integrates these features with dual attention structures that focus on residues and amino acid signals, generating latent representations of protein trajectories. Furthermore, ProtSCAPE incorporates a regression head to enforce temporally coherent latent representations.

* Accepted as a short paper at the 5th Molecular Machine Learning Conference (MoML 2024)

Via

Access Paper or Ask Questions

Guided Generative Protein Design using Regularized Transformers

Jan 24, 2022

Egbert Castro, Abhinav Godavarthi, Julian Rubinfien, Kevin B. Givechian, Dhananjay Bhaskar, Smita Krishnaswamy

Figure 1 for Guided Generative Protein Design using Regularized Transformers

Figure 2 for Guided Generative Protein Design using Regularized Transformers

Figure 3 for Guided Generative Protein Design using Regularized Transformers

Figure 4 for Guided Generative Protein Design using Regularized Transformers

Abstract:The development of powerful natural language models have increased the ability to learn meaningful representations of protein sequences. In addition, advances in high-throughput mutagenesis, directed evolution, and next-generation sequencing have allowed for the accumulation of large amounts of labeled fitness data. Leveraging these two trends, we introduce Regularized Latent Space Optimization (ReLSO), a deep transformer-based autoencoder which is trained to jointly generate sequences as well as predict fitness. Using ReLSO, we explicitly model the underlying sequence-function landscape of large labeled datasets and optimize within latent space using gradient-based methods. Through regularized prediction heads, ReLSO introduces a powerful protein sequence encoder and novel approach for efficient fitness landscape traversal.

Via

Access Paper or Ask Questions

Uncovering the Folding Landscape of RNA Secondary Structure with Deep Graph Embeddings

Jun 16, 2020

Egbert Castro, Andrew Benz, Alexander Tong, Guy Wolf, Smita Krishnaswamy

Figure 1 for Uncovering the Folding Landscape of RNA Secondary Structure with Deep Graph Embeddings

Figure 2 for Uncovering the Folding Landscape of RNA Secondary Structure with Deep Graph Embeddings

Figure 3 for Uncovering the Folding Landscape of RNA Secondary Structure with Deep Graph Embeddings

Figure 4 for Uncovering the Folding Landscape of RNA Secondary Structure with Deep Graph Embeddings

Abstract:Biomolecular graph analysis has recently gained much attention in the emerging field of geometric deep learning. While numerous approaches aim to train classifiers that accurately predict molecular properties from graphs that encode their structure, an equally important task is to organize biomolecular graphs in ways that expose meaningful relations and variations between them. We propose a geometric scattering autoencoder (GSAE) network for learning such graph embeddings. Our embedding network first extracts rich graph features using the recently proposed geometric scattering transform. Then, it leverages a semi-supervised variational autoencoder to extract a low-dimensional embedding that retains the information in these features that enable prediction of molecular properties as well as characterize graphs. Our approach is based on the intuition that geometric scattering generates multi-resolution features with in-built invariance to deformations, but as they are unsupervised, these features may not be tuned for optimally capturing relevant domain-specific properties. We demonstrate the effectiveness of our approach to data exploration of RNA foldings. Like proteins, RNA molecules can fold to create low energy functional structures such as hairpins, but the landscape of possible folds and fold sequences are not well visualized by existing methods. We show that GSAE organizes RNA graphs both by structure and energy, accurately reflecting bistable RNA structures. Furthermore, it enables interpolation of embedded molecule sequences mimicking folding trajectories. Finally, using an auxiliary inverse-scattering model, we demonstrate our ability to generate synthetic RNA graphs along the trajectory thus providing hypothetical folding sequences for further analysis.

Via

Access Paper or Ask Questions