Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hongwei Jin

State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University

ICML Topological Deep Learning Challenge 2024: Beyond the Graph Domain

Sep 08, 2024

Guillermo Bernárdez, Lev Telyatnikov, Marco Montagna, Federica Baccini, Mathilde Papillon, Miquel Ferriol-Galmés, Mustafa Hajij, Theodore Papamarkou, Maria Sofia Bucarelli, Olga Zaghen(+63 more)

Figure 1 for ICML Topological Deep Learning Challenge 2024: Beyond the Graph Domain

Figure 2 for ICML Topological Deep Learning Challenge 2024: Beyond the Graph Domain

Figure 3 for ICML Topological Deep Learning Challenge 2024: Beyond the Graph Domain

Abstract:This paper describes the 2nd edition of the ICML Topological Deep Learning Challenge that was hosted within the ICML 2024 ELLIS Workshop on Geometry-grounded Representation Learning and Generative Modeling (GRaM). The challenge focused on the problem of representing data in different discrete topological domains in order to bridge the gap between Topological Deep Learning (TDL) and other types of structured datasets (e.g. point clouds, graphs). Specifically, participants were asked to design and implement topological liftings, i.e. mappings between different data structures and topological domains --like hypergraphs, or simplicial/cell/combinatorial complexes. The challenge received 52 submissions satisfying all the requirements. This paper introduces the main scope of the challenge, and summarizes the main results and findings.

* Proceedings of the Geometry-grounded Representation Learning and Generative Modeling Workshop (GRaM) at ICML 2024

Via

Access Paper or Ask Questions

Large Language Models for Anomaly Detection in Computational Workflows: from Supervised Fine-Tuning to In-Context Learning

Jul 24, 2024

Hongwei Jin, George Papadimitriou, Krishnan Raghavan, Pawel Zuk, Prasanna Balaprakash, Cong Wang, Anirban Mandal, Ewa Deelman

Figure 1 for Large Language Models for Anomaly Detection in Computational Workflows: from Supervised Fine-Tuning to In-Context Learning

Figure 2 for Large Language Models for Anomaly Detection in Computational Workflows: from Supervised Fine-Tuning to In-Context Learning

Figure 3 for Large Language Models for Anomaly Detection in Computational Workflows: from Supervised Fine-Tuning to In-Context Learning

Figure 4 for Large Language Models for Anomaly Detection in Computational Workflows: from Supervised Fine-Tuning to In-Context Learning

Abstract:Anomaly detection in computational workflows is critical for ensuring system reliability and security. However, traditional rule-based methods struggle to detect novel anomalies. This paper leverages large language models (LLMs) for workflow anomaly detection by exploiting their ability to learn complex data patterns. Two approaches are investigated: 1) supervised fine-tuning (SFT), where pre-trained LLMs are fine-tuned on labeled data for sentence classification to identify anomalies, and 2) in-context learning (ICL) where prompts containing task descriptions and examples guide LLMs in few-shot anomaly detection without fine-tuning. The paper evaluates the performance, efficiency, generalization of SFT models, and explores zero-shot and few-shot ICL prompts and interpretability enhancement via chain-of-thought prompting. Experiments across multiple workflow datasets demonstrate the promising potential of LLMs for effective anomaly detection in complex executions.

* 12 pages, 14 figures, paper is accepted by SC'24, source code, see: https://github.com/PoSeiDon-Workflows/LLM_AD

Via

Access Paper or Ask Questions

Physics-Informed Heterogeneous Graph Neural Networks for DC Blocker Placement

May 16, 2024

Hongwei Jin, Prasanna Balaprakash, Allen Zou, Pieter Ghysels, Aditi S. Krishnapriyan, Adam Mate, Arthur Barnes, Russell Bent

Figure 1 for Physics-Informed Heterogeneous Graph Neural Networks for DC Blocker Placement

Figure 2 for Physics-Informed Heterogeneous Graph Neural Networks for DC Blocker Placement

Figure 3 for Physics-Informed Heterogeneous Graph Neural Networks for DC Blocker Placement

Figure 4 for Physics-Informed Heterogeneous Graph Neural Networks for DC Blocker Placement

Abstract:The threat of geomagnetic disturbances (GMDs) to the reliable operation of the bulk energy system has spurred the development of effective strategies for mitigating their impacts. One such approach involves placing transformer neutral blocking devices, which interrupt the path of geomagnetically induced currents (GICs) to limit their impact. The high cost of these devices and the sparsity of transformers that experience high GICs during GMD events, however, calls for a sparse placement strategy that involves high computational cost. To address this challenge, we developed a physics-informed heterogeneous graph neural network (PIHGNN) for solving the graph-based dc-blocker placement problem. Our approach combines a heterogeneous graph neural network (HGNN) with a physics-informed neural network (PINN) to capture the diverse types of nodes and edges in ac/dc networks and incorporates the physical laws of the power grid. We train the PIHGNN model using a surrogate power flow model and validate it using case studies. Results demonstrate that PIHGNN can effectively and efficiently support the deployment of GIC dc-current blockers, ensuring the continued supply of electricity to meet societal demands. Our approach has the potential to contribute to the development of more reliable and resilient power grids capable of withstanding the growing threat that GMDs pose.

* Paper is accepted by PSCC 2024

Via

Access Paper or Ask Questions

Latent Chemical Space Searching for Plug-in Multi-objective Molecule Generation

Apr 10, 2024

Ningfeng Liu, Jie Yu, Siyu Xiu, Xinfang Zhao, Siyu Lin, Bo Qiang, Ruqiu Zheng, Hongwei Jin, Liangren Zhang, Zhenming Liu

Figure 1 for Latent Chemical Space Searching for Plug-in Multi-objective Molecule Generation

Figure 2 for Latent Chemical Space Searching for Plug-in Multi-objective Molecule Generation

Figure 3 for Latent Chemical Space Searching for Plug-in Multi-objective Molecule Generation

Figure 4 for Latent Chemical Space Searching for Plug-in Multi-objective Molecule Generation

Abstract:Molecular generation, an essential method for identifying new drug structures, has been supported by advancements in machine learning and computational technology. However, challenges remain in multi-objective generation, model adaptability, and practical application in drug discovery. In this study, we developed a versatile 'plug-in' molecular generation model that incorporates multiple objectives related to target affinity, drug-likeness, and synthesizability, facilitating its application in various drug development contexts. We improved the Particle Swarm Optimization (PSO) in the context of drug discoveries, and identified PSO-ENP as the optimal variant for multi-objective molecular generation and optimization through comparative experiments. The model also incorporates a novel target-ligand affinity predictor, enhancing the model's utility by supporting three-dimensional information and improving synthetic feasibility. Case studies focused on generating and optimizing drug-like big marine natural products were performed, underscoring PSO-ENP's effectiveness and demonstrating its considerable potential for practical drug discovery applications.

Via

Access Paper or Ask Questions

Self-supervised Learning for Anomaly Detection in Computational Workflows

Oct 02, 2023

Hongwei Jin, Krishnan Raghavan, George Papadimitriou, Cong Wang, Anirban Mandal, Ewa Deelman, Prasanna Balaprakash

Figure 1 for Self-supervised Learning for Anomaly Detection in Computational Workflows

Figure 2 for Self-supervised Learning for Anomaly Detection in Computational Workflows

Figure 3 for Self-supervised Learning for Anomaly Detection in Computational Workflows

Figure 4 for Self-supervised Learning for Anomaly Detection in Computational Workflows

Abstract:Anomaly detection is the task of identifying abnormal behavior of a system. Anomaly detection in computational workflows is of special interest because of its wide implications in various domains such as cybersecurity, finance, and social networks. However, anomaly detection in computational workflows~(often modeled as graphs) is a relatively unexplored problem and poses distinct challenges. For instance, when anomaly detection is performed on graph data, the complex interdependency of nodes and edges, the heterogeneity of node attributes, and edge types must be accounted for. Although the use of graph neural networks can help capture complex inter-dependencies, the scarcity of labeled anomalous examples from workflow executions is still a significant challenge. To address this problem, we introduce an autoencoder-driven self-supervised learning~(SSL) approach that learns a summary statistic from unlabeled workflow data and estimates the normal behavior of the computational workflow in the latent space. In this approach, we combine generative and contrastive learning objectives to detect outliers in the summary statistics. We demonstrate that by estimating the distribution of normal behavior in the latent space, we can outperform state-of-the-art anomaly detection methods on our benchmark datasets.

Via

Access Paper or Ask Questions

Orthogonal Gromov-Wasserstein Discrepancy with Efficient Lower Bound

May 12, 2022

Hongwei Jin, Zishun Yu, Xinhua Zhang

Figure 1 for Orthogonal Gromov-Wasserstein Discrepancy with Efficient Lower Bound

Figure 2 for Orthogonal Gromov-Wasserstein Discrepancy with Efficient Lower Bound

Figure 3 for Orthogonal Gromov-Wasserstein Discrepancy with Efficient Lower Bound

Figure 4 for Orthogonal Gromov-Wasserstein Discrepancy with Efficient Lower Bound

Abstract:Comparing structured data from possibly different metric-measure spaces is a fundamental task in machine learning, with applications in, e.g., graph classification. The Gromov-Wasserstein (GW) discrepancy formulates a coupling between the structured data based on optimal transportation, tackling the incomparability between different structures by aligning the intra-relational geometries. Although efficient local solvers such as conditional gradient and Sinkhorn are available, the inherent non-convexity still prevents a tractable evaluation, and the existing lower bounds are not tight enough for practical use. To address this issue, we take inspiration from the connection with the quadratic assignment problem, and propose the orthogonal Gromov-Wasserstein (OGW) discrepancy as a surrogate of GW. It admits an efficient and closed-form lower bound with the complexity of $\mathcal{O}(n^3)$, and directly extends to the fused Gromov-Wasserstein (FGW) distance, incorporating node features into the coupling. Extensive experiments on both the synthetic and real-world datasets show the tightness of our lower bounds, and both OGW and its lower bounds efficiently deliver accurate predictions and satisfactory barycenters for graph sets.

Via

Access Paper or Ask Questions

Gromov-Wasserstein Discrepancy with Local Differential Privacy for Distributed Structural Graphs

Feb 01, 2022

Hongwei Jin, Xun Chen

Figure 1 for Gromov-Wasserstein Discrepancy with Local Differential Privacy for Distributed Structural Graphs

Figure 2 for Gromov-Wasserstein Discrepancy with Local Differential Privacy for Distributed Structural Graphs

Figure 3 for Gromov-Wasserstein Discrepancy with Local Differential Privacy for Distributed Structural Graphs

Figure 4 for Gromov-Wasserstein Discrepancy with Local Differential Privacy for Distributed Structural Graphs

Abstract:Learning the similarity between structured data, especially the graphs, is one of the essential problems. Besides the approach like graph kernels, Gromov-Wasserstein (GW) distance recently draws big attention due to its flexibility to capture both topological and feature characteristics, as well as handling the permutation invariance. However, structured data are widely distributed for different data mining and machine learning applications. With privacy concerns, accessing the decentralized data is limited to either individual clients or different silos. To tackle these issues, we propose a privacy-preserving framework to analyze the GW discrepancy of node embedding learned locally from graph neural networks in a federated flavor, and then explicitly place local differential privacy (LDP) based on Multi-bit Encoder to protect sensitive information. Our experiments show that, with strong privacy protections guaranteed by the $\varepsilon$-LDP algorithm, the proposed framework not only preserves privacy in graph learning but also presents a noised structural metric under GW distance, resulting in comparable and even better performance in classification and clustering tasks. Moreover, we reason the rationale behind the LDP-based GW distance analytically and empirically.

Via

Access Paper or Ask Questions

TF3P: Three-dimensional Force Fields Fingerprint Learned by Deep Capsular Network

Dec 25, 2019

Yanxing Wang, Jianxing Hu, Junyong Lai, Yibo Li, Hongwei Jin, Lihe Zhang, Liangren Zhang, Zhenming Liu

Figure 1 for TF3P: Three-dimensional Force Fields Fingerprint Learned by Deep Capsular Network

Figure 2 for TF3P: Three-dimensional Force Fields Fingerprint Learned by Deep Capsular Network

Figure 3 for TF3P: Three-dimensional Force Fields Fingerprint Learned by Deep Capsular Network

Figure 4 for TF3P: Three-dimensional Force Fields Fingerprint Learned by Deep Capsular Network

Abstract:Molecular fingerprints are the workhorse in ligand-based drug discovery. In recent years, increasing number of research papers reported fascinating results on using deep neural networks to learn 2D molecular representations as fingerprints. One may anticipate that the integration of deep learning would also contribute to the prosperity of 3D fingerprints. Here, we presented a new 3D small molecule fingerprint, the three-dimensional force fields fingerprint (TF3P), learned by deep capsular network whose training is in no need of labeled dataset for specific predictive tasks. TF3P can encode the 3D force fields information of molecules and demonstrates its stronger ability to capture 3D structural changes, recognize molecules alike in 3D but not in 2D, and recognize similar targets inaccessible by other fingerprints, including the solely existing 3D fingerprint E3FP, based on only ligands similarity. Furthermore, TF3P is compatible with both statistical models (e.g. similarity ensemble approach) and machine learning models. Altogether, we report TF3P as a new 3D small molecule fingerprint with promising future in ligand-based drug discovery.

Via

Access Paper or Ask Questions