Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yinhua Piao

GraphT5: Unified Molecular Graph-Language Modeling via Multi-Modal Cross-Token Attention

Mar 07, 2025

Sangyeup Kim, Nayeon Kim, Yinhua Piao, Sun Kim

Figure 1 for GraphT5: Unified Molecular Graph-Language Modeling via Multi-Modal Cross-Token Attention

Figure 2 for GraphT5: Unified Molecular Graph-Language Modeling via Multi-Modal Cross-Token Attention

Figure 3 for GraphT5: Unified Molecular Graph-Language Modeling via Multi-Modal Cross-Token Attention

Figure 4 for GraphT5: Unified Molecular Graph-Language Modeling via Multi-Modal Cross-Token Attention

Abstract:Molecular language modeling tasks such as molecule captioning have been recognized for their potential to further understand molecular properties that can aid drug discovery or material synthesis based on chemical reactions. Unlike the common use of molecule graphs in predicting molecular properties, most methods in molecular language modeling rely heavily on SMILES sequences. This preference is because the task involves generating a sequence of multiple tokens using transformer-based models. Therefore, a main challenge is determining how to integrate graph data, which contains structural and spatial information about molecules, with text data. In addition, simply using both 1D SMILES text and 2D graph as inputs without addressing how they align and represent the molecule structure in different modalities makes it challenging to fully utilize structural knowledge about molecules. To this end, we propose GraphT5, a multi-modal framework that integrates 1D SMILES text and 2D graph representations of molecules for molecular language modeling. Specifically, we introduce a novel cross-token attention module in GraphT5 to bridge the gap arising from the fundamental differences between the two modalities of molecule representations. Cross-token attention exploits implicit information between SMILES and graphs of molecules, resulting from their interactions at a fine-grained token level that benefits molecular language modeling. Extensive experiments including molecule captioning, IUPAC name prediction tasks, and case studies show that our GraphT5 outperforms the latest baseline approaches, which validates the effectiveness of our GraphT5 in sufficiently utilizing 1D SMILES text and 2D graph representations.

Via

Access Paper or Ask Questions

Improving out-of-distribution generalization in graphs via hierarchical semantic environments

Mar 04, 2024

Yinhua Piao, Sangseon Lee, Yijingxiu Lu, Sun Kim

Figure 1 for Improving out-of-distribution generalization in graphs via hierarchical semantic environments

Figure 2 for Improving out-of-distribution generalization in graphs via hierarchical semantic environments

Figure 3 for Improving out-of-distribution generalization in graphs via hierarchical semantic environments

Figure 4 for Improving out-of-distribution generalization in graphs via hierarchical semantic environments

Abstract:Out-of-distribution (OOD) generalization in the graph domain is challenging due to complex distribution shifts and a lack of environmental contexts. Recent methods attempt to enhance graph OOD generalization by generating flat environments. However, such flat environments come with inherent limitations to capture more complex data distributions. Considering the DrugOOD dataset, which contains diverse training environments (e.g., scaffold, size, etc.), flat contexts cannot sufficiently address its high heterogeneity. Thus, a new challenge is posed to generate more semantically enriched environments to enhance graph invariant learning for handling distribution shifts. In this paper, we propose a novel approach to generate hierarchical semantic environments for each graph. Firstly, given an input graph, we explicitly extract variant subgraphs from the input graph to generate proxy predictions on local environments. Then, stochastic attention mechanisms are employed to re-extract the subgraphs for regenerating global environments in a hierarchical manner. In addition, we introduce a new learning objective that guides our model to learn the diversity of environments within the same hierarchy while maintaining consistency across different hierarchies. This approach enables our model to consider the relationships between environments and facilitates robust graph invariant learning. Extensive experiments on real-world graph data have demonstrated the effectiveness of our framework. Particularly, in the challenging dataset DrugOOD, our method achieves up to 1.29\% and 2.83\% improvement over the best baselines on IC50 and EC50 prediction tasks, respectively.

* Accepted by CVPR 2024

Via

Access Paper or Ask Questions

Clinical Note Owns its Hierarchy: Multi-Level Hypergraph Neural Networks for Patient-Level Representation Learning

May 16, 2023

Nayeon Kim, Yinhua Piao, Sun Kim

Figure 1 for Clinical Note Owns its Hierarchy: Multi-Level Hypergraph Neural Networks for Patient-Level Representation Learning

Figure 2 for Clinical Note Owns its Hierarchy: Multi-Level Hypergraph Neural Networks for Patient-Level Representation Learning

Figure 3 for Clinical Note Owns its Hierarchy: Multi-Level Hypergraph Neural Networks for Patient-Level Representation Learning

Figure 4 for Clinical Note Owns its Hierarchy: Multi-Level Hypergraph Neural Networks for Patient-Level Representation Learning

Abstract:Leveraging knowledge from electronic health records (EHRs) to predict a patient's condition is essential to the effective delivery of appropriate care. Clinical notes of patient EHRs contain valuable information from healthcare professionals, but have been underused due to their difficult contents and complex hierarchies. Recently, hypergraph-based methods have been proposed for document classifications. Directly adopting existing hypergraph methods on clinical notes cannot sufficiently utilize the hierarchy information of the patient, which can degrade clinical semantic information by (1) frequent neutral words and (2) hierarchies with imbalanced distribution. Thus, we propose a taxonomy-aware multi-level hypergraph neural network (TM-HGNN), where multi-level hypergraphs assemble useful neutral words with rare keywords via note and taxonomy level hyperedges to retain the clinical semantic information. The constructed patient hypergraphs are fed into hierarchical message passing layers for learning more balanced multi-level knowledge at the note and taxonomy levels. We validate the effectiveness of TM-HGNN by conducting extensive experiments with MIMIC-III dataset on benchmark in-hospital-mortality prediction.

* ACL 2023 Main Conference

Via

Access Paper or Ask Questions

SPGP: Structure Prototype Guided Graph Pooling

Sep 16, 2022

Sangseon Lee, Dohoon Lee, Yinhua Piao, Sun Kim

Figure 1 for SPGP: Structure Prototype Guided Graph Pooling

Figure 2 for SPGP: Structure Prototype Guided Graph Pooling

Figure 3 for SPGP: Structure Prototype Guided Graph Pooling

Figure 4 for SPGP: Structure Prototype Guided Graph Pooling

Abstract:While graph neural networks (GNNs) have been successful for node classification tasks and link prediction tasks in graph, learning graph-level representations still remains a challenge. For the graph-level representation, it is important to learn both representation of neighboring nodes, i.e., aggregation, and graph structural information. A number of graph pooling methods have been developed for this goal. However, most of the existing pooling methods utilize k-hop neighborhood without considering explicit structural information in a graph. In this paper, we propose Structure Prototype Guided Pooling (SPGP) that utilizes prior graph structures to overcome the limitation. SPGP formulates graph structures as learnable prototype vectors and computes the affinity between nodes and prototype vectors. This leads to a novel node scoring scheme that prioritizes informative nodes while encapsulating the useful structures of the graph. Our experimental results show that SPGP outperforms state-of-the-art graph pooling methods on graph classification benchmark datasets in both accuracy and scalability.

* 18 pages, 6 figures

Via

Access Paper or Ask Questions

Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

Dec 13, 2021

Yinhua Piao, Sangseon Lee, Dohoon Lee, Sun Kim

Figure 1 for Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

Figure 2 for Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

Figure 3 for Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

Figure 4 for Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

Abstract:Recently, graph neural networks (GNNs) have been widely used for document classification. However, most existing methods are based on static word co-occurrence graphs without sentence-level information, which poses three challenges:(1) word ambiguity, (2) word synonymity, and (3) dynamic contextual dependency. To address these challenges, we propose a novel GNN-based sparse structure learning model for inductive document classification. Specifically, a document-level graph is initially generated by a disjoint union of sentence-level word co-occurrence graphs. Our model collects a set of trainable edges connecting disjoint words between sentences and employs structure learning to sparsely select edges with dynamic contextual dependencies. Graphs with sparse structures can jointly exploit local and global contextual information in documents through GNNs. For inductive learning, the refined document graph is further fed into a general readout function for graph-level classification and optimization in an end-to-end manner. Extensive experiments on several real-world datasets demonstrate that the proposed model outperforms most state-of-the-art results, and reveal the necessity to learn sparse structures for each document.

* Accepted by AAAI 2022

Via

Access Paper or Ask Questions