Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jim Brase

AMPL: A Data-Driven Modeling Pipeline for Drug Discovery

Nov 14, 2019

Amanda J. Minnich, Kevin McLoughlin, Margaret Tse, Jason Deng, Andrew Weber, Neha Murad, Benjamin D. Madej, Bharath Ramsundar, Tom Rush, Stacie Calad-Thomson(+2 more)

Figure 1 for AMPL: A Data-Driven Modeling Pipeline for Drug Discovery

Figure 2 for AMPL: A Data-Driven Modeling Pipeline for Drug Discovery

Figure 3 for AMPL: A Data-Driven Modeling Pipeline for Drug Discovery

Figure 4 for AMPL: A Data-Driven Modeling Pipeline for Drug Discovery

Abstract:One of the key requirements for incorporating machine learning into the drug discovery process is complete reproducibility and traceability of the model building and evaluation process. With this in mind, we have developed an end-to-end modular and extensible software pipeline for building and sharing machine learning models that predict key pharma-relevant parameters. The ATOM Modeling PipeLine, or AMPL, extends the functionality of the open source library DeepChem and supports an array of machine learning and molecular featurization tools. We have benchmarked AMPL on a large collection of pharmaceutical datasets covering a wide range of parameters. As a result of these comprehensive experiments, we have found that physicochemical descriptors and deep learning-based graph representations significantly outperform traditional fingerprints in the characterization of molecular features. We have also found that dataset size is directly correlated to prediction performance, and that single-task deep learning models only outperform shallow learners if there is sufficient data. Likewise, dataset size has a direct impact on model predictivity, independent of comprehensive hyperparameter model tuning. Our findings point to the need for public dataset integration or multi-task/transfer learning approaches. Lastly, we found that uncertainty quantification (UQ) analysis may help identify model error; however, efficacy of UQ to filter predictions varies considerably between datasets and featurization/model types. AMPL is open source and available for download at http://github.com/ATOMconsortium/AMPL.

Via

Access Paper or Ask Questions

Precision Medicine as an Accelerator for Next Generation Cognitive Supercomputing

Apr 29, 2018

Edmon Begoli, Jim Brase, Bambi DeLaRosa, Penelope Jones, Dimitri Kusnezov, Jason Paragas, Rick Stevens, Fred Streitz, Georgia Tourassi

Figure 1 for Precision Medicine as an Accelerator for Next Generation Cognitive Supercomputing

Figure 2 for Precision Medicine as an Accelerator for Next Generation Cognitive Supercomputing

Figure 3 for Precision Medicine as an Accelerator for Next Generation Cognitive Supercomputing

Figure 4 for Precision Medicine as an Accelerator for Next Generation Cognitive Supercomputing

Abstract:In the past several years, we have taken advantage of a number of opportunities to advance the intersection of next generation high-performance computing AI and big data technologies through partnerships in precision medicine. Today we are in the throes of piecing together what is likely the most unique convergence of medical data and computer technologies. But more deeply, we observe that the traditional paradigm of computer simulation and prediction needs fundamental revision. This is the time for a number of reasons. We will review what the drivers are, why now, how this has been approached over the past several years, and where we are heading.

* SUPERCOMPUTING FRONTIERS AND INNOVATIONS, 2018

Via

Access Paper or Ask Questions