Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tianren Zhang

Multimodal machine learning with large language embedding model for polymer property prediction

Mar 29, 2025

Tianren Zhang, Dai-Bei Yang

Abstract:Contemporary large language models (LLMs), such as GPT-4 and Llama, have harnessed extensive computational power and diverse text corpora to achieve remarkable proficiency in interpreting and generating domain-specific content, including materials science. To leverage the domain knowledge embedded within these models, we propose a simple yet effective multimodal architecture, PolyLLMem, which integrates text embeddings generated by Llama 3 with molecular structure embeddings derived from Uni-Mol, for polymer properties prediction tasks. In our model, Low-rank adaptation (LoRA) layers were also incorporated during the property prediction tasks to refine the embeddings based on our limited polymer dataset, thereby enhancing their chemical relevance for polymer SMILES representation. This balanced fusion of fine-tuned textual and structural information enables PolyLLMem to accurately predict a variety of polymer properties despite the scarcity of training data. Its performance is comparable to, and in some cases exceeds, that of graph-based models, as well as transformer-based models that typically require pretraining on millions of polymer samples. These findings demonstrate that LLM, such as Llama, can effectively capture chemical information encoded in polymer PSMILES, and underscore the efficacy of multimodal fusion of LLM embeddings and molecular structure embeddings in overcoming data scarcity and accelerating the discovery of advanced polymeric materials.

Via

Access Paper or Ask Questions

Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them

Mar 20, 2025

Guanyu Chen, Peiyang Wang, Tianren Zhang, Feng Chen

Abstract:Large language models (LLMs) and Vision language models (VLMs) have been able to perform various forms of reasoning tasks in a wide range of scenarios, but are they truly engaging in task abstraction and rule-based reasoning beyond mere memorization and pattern matching? To answer this question, we propose a novel experimental approach, Misleading Fine-Tuning (MisFT), to examine whether LLMs/VLMs perform abstract reasoning by altering their original understanding of fundamental rules. In particular, by constructing a dataset with math expressions that contradict correct operation principles, we fine-tune the model to learn those contradictory rules and assess its generalization ability on different test domains. Through a series of experiments, we find that current LLMs/VLMs are capable of effectively applying contradictory rules to solve practical math word problems and math expressions represented by images, implying the presence of an internal mechanism that abstracts before reasoning.

Via

Access Paper or Ask Questions

When do neural networks learn world models?

Feb 13, 2025

Tianren Zhang, Guanyu Chen, Feng Chen

Abstract:Humans develop world models that capture the underlying generation process of data. Whether neural networks can learn similar world models remains an open problem. In this work, we provide the first theoretical results for this problem, showing that in a multi-task setting, models with a low-degree bias provably recover latent data-generating variables under mild assumptions -- even if proxy tasks involve complex, non-linear functions of the latents. However, such recovery is also sensitive to model architecture. Our analysis leverages Boolean models of task solutions via the Fourier-Walsh transform and introduces new techniques for analyzing invertible Boolean transforms, which may be of independent interest. We illustrate the algorithmic implications of our results and connect them to related research areas, including self-supervised learning, out-of-distribution generalization, and the linear representation hypothesis in large language models.

* 28 pages, 9 figures

Via

Access Paper or Ask Questions

Feature Contamination: Neural Networks Learn Uncorrelated Features and Fail to Generalize

Jun 06, 2024

Tianren Zhang, Chujie Zhao, Guanyu Chen, Yizhou Jiang, Feng Chen

Figure 1 for Feature Contamination: Neural Networks Learn Uncorrelated Features and Fail to Generalize

Figure 2 for Feature Contamination: Neural Networks Learn Uncorrelated Features and Fail to Generalize

Figure 3 for Feature Contamination: Neural Networks Learn Uncorrelated Features and Fail to Generalize

Figure 4 for Feature Contamination: Neural Networks Learn Uncorrelated Features and Fail to Generalize

Abstract:Learning representations that generalize under distribution shifts is critical for building robust machine learning models. However, despite significant efforts in recent years, algorithmic advances in this direction have been limited. In this work, we seek to understand the fundamental difficulty of out-of-distribution generalization with deep neural networks. We first empirically show that perhaps surprisingly, even allowing a neural network to explicitly fit the representations obtained from a teacher network that can generalize out-of-distribution is insufficient for the generalization of the student network. Then, by a theoretical study of two-layer ReLU networks optimized by stochastic gradient descent (SGD) under a structured feature model, we identify a fundamental yet unexplored feature learning proclivity of neural networks, feature contamination: neural networks can learn uncorrelated features together with predictive features, resulting in generalization failure under distribution shifts. Notably, this mechanism essentially differs from the prevailing narrative in the literature that attributes the generalization failure to spurious correlations. Overall, our results offer new insights into the non-linear feature learning dynamics of neural networks and highlight the necessity of considering inductive biases in out-of-distribution generalization.

* ICML 2024

Via

Access Paper or Ask Questions

Preserving Silent Features for Domain Generalization

Jan 06, 2024

Chujie Zhao, Tianren Zhang, Feng Chen

Abstract:Domain generalization (DG) aims to improve the generalization ability of the model trained on several known training domains over unseen test domains. Previous work has shown that self-supervised contrastive pre-training improves the robustness of the model on downstream tasks. However, in this paper, we find that self-supervised models do not exhibit better generalization performance than supervised models pre-trained on the same dataset in the DG setting. We argue that this is owing to the fact that the richer intra-class discriminative features extracted by self-supervised contrastive learning, which we term silent features, are suppressed during supervised fine-tuning. These silent features are likely to contain features that are more generalizable on the test domain. In this work, we model and analyze this feature suppression phenomenon and theoretically prove that preserving silent features can achieve lower expected test domain risk under certain conditions. In light of this, we propose a simple yet effective method termed STEP (Silent Feature Preservation) to improve the generalization performance of the self-supervised contrastive learning pre-trained model by alleviating the suppression of silent features during the supervised fine-tuning process. Experimental results show that STEP exhibits state-of-the-art performance on standard DG benchmarks with significant distribution shifts.

Via

Access Paper or Ask Questions

Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Jan 26, 2022

Yihan Li, Jinsheng Ren, Tianrun Xu, Tianren Zhang, Haichuan Gao, Feng Chen

Figure 1 for Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Figure 2 for Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Figure 3 for Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Figure 4 for Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Abstract:Recently, incorporating natural language instructions into reinforcement learning (RL) to learn semantically meaningful representations and foster generalization has caught many concerns. However, the semantical information in language instructions is usually entangled with task-specific state information, which hampers the learning of semantically invariant and reusable representations. In this paper, we propose a method to learn such representations called element randomization, which extracts task-relevant but environment-agnostic semantics from instructions using a set of environments with randomized elements, e.g., topological structures or textures, yet the same language instruction. We theoretically prove the feasibility of learning semantically invariant representations through randomization. In practice, we accordingly develop a hierarchy of policies, where a high-level policy is designed to modulate the behavior of a goal-conditioned low-level policy by proposing subgoals as semantically invariant representations. Experiments on challenging long-horizon tasks show that (1) our low-level policy reliably generalizes to tasks against environment changes; (2) our hierarchical policy exhibits extensible generalization in unseen new tasks that can be decomposed into several solvable sub-tasks; and (3) by storing and replaying language trajectories as succinct policy representations, the agent can complete tasks in a one-shot fashion, i.e., once one successful trajectory has been attained.

Via

Access Paper or Ask Questions

Adjacency constraint for efficient hierarchical reinforcement learning

Oct 30, 2021

Tianren Zhang, Shangqi Guo, Tian Tan, Xiaolin Hu, Feng Chen

Figure 1 for Adjacency constraint for efficient hierarchical reinforcement learning

Figure 2 for Adjacency constraint for efficient hierarchical reinforcement learning

Figure 3 for Adjacency constraint for efficient hierarchical reinforcement learning

Figure 4 for Adjacency constraint for efficient hierarchical reinforcement learning

Abstract:Goal-conditioned Hierarchical Reinforcement Learning (HRL) is a promising approach for scaling up reinforcement learning (RL) techniques. However, it often suffers from training inefficiency as the action space of the high-level, i.e., the goal space, is large. Searching in a large goal space poses difficulty for both high-level subgoal generation and low-level policy learning. In this paper, we show that this problem can be effectively alleviated by restricting the high-level action space from the whole goal space to a $k$-step adjacent region of the current state using an adjacency constraint. We theoretically prove that in a deterministic Markov Decision Process (MDP), the proposed adjacency constraint preserves the optimal hierarchical policy, while in a stochastic MDP the adjacency constraint induces a bounded state-value suboptimality determined by the MDP's transition structure. We further show that this constraint can be practically implemented by training an adjacency network that can discriminate between adjacent and non-adjacent subgoals. Experimental results on discrete and continuous control tasks including challenging simulated robot locomotion and manipulation tasks show that incorporating the adjacency constraint significantly boosts the performance of state-of-the-art goal-conditioned HRL approaches.

* arXiv admin note: substantial text overlap with arXiv:2006.11485

Via

Access Paper or Ask Questions

Subjective Learning for Open-Ended Data

Aug 27, 2021

Tianren Zhang, Yizhou Jiang, Xin Su, Shangqi Guo, Feng Chen

Figure 1 for Subjective Learning for Open-Ended Data

Figure 2 for Subjective Learning for Open-Ended Data

Figure 3 for Subjective Learning for Open-Ended Data

Figure 4 for Subjective Learning for Open-Ended Data

Abstract:Conventional machine learning methods typically assume that data is split according to tasks, and the data in each task can be modeled by a single target function. However, this assumption is invalid in open-ended environments where no manual task definition is available. In this paper, we present a novel supervised learning paradigm of learning from open-ended data. Open-ended data inherently requires multiple single-valued deterministic mapping functions to capture all its input-output relations, exhibiting an essential structural difference from conventional supervised data. We formally expound this structural property with a novel concept termed as mapping rank, and show that open-ended data poses a fundamental difficulty for conventional supervised learning, since different data samples may conflict with each other if the mapping rank of data is larger than one. To address this issue, we devise an Open-ended Supervised Learning (OSL) framework, of which the key innovation is a subjective function that automatically allocates the data among multiple candidate models to resolve the conflict, developing a natural cognition hierarchy. We demonstrate the efficacy of OSL both theoretically and empirically, and show that OSL achieves human-like task cognition without task-level supervision.

Via

Access Paper or Ask Questions

CRIL: Continual Robot Imitation Learning via Generative and Prediction Model

Jul 02, 2021

Chongkai Gao, Haichuan Gao, Shangqi Guo, Tianren Zhang, Feng Chen

Figure 1 for CRIL: Continual Robot Imitation Learning via Generative and Prediction Model

Figure 2 for CRIL: Continual Robot Imitation Learning via Generative and Prediction Model

Figure 3 for CRIL: Continual Robot Imitation Learning via Generative and Prediction Model

Figure 4 for CRIL: Continual Robot Imitation Learning via Generative and Prediction Model

Abstract:Imitation learning (IL) algorithms have shown promising results for robots to learn skills from expert demonstrations. However, they need multi-task demonstrations to be provided at once for acquiring diverse skills, which is difficult in real world. In this work we study how to realize continual imitation learning ability that empowers robots to continually learn new tasks one by one, thus reducing the burden of multi-task IL and accelerating the process of new task learning at the same time. We propose a novel trajectory generation model that employs both a generative adversarial network and a dynamics-aware prediction model to generate pseudo trajectories from all learned tasks in the new task learning process. Our experiments on both simulation and real-world manipulation tasks demonstrate the effectiveness of our method.

Via

Access Paper or Ask Questions

Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning

Jun 20, 2020

Tianren Zhang, Shangqi Guo, Tian Tan, Xiaolin Hu, Feng Chen

Figure 1 for Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning

Figure 2 for Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning

Figure 3 for Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning

Figure 4 for Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning

Abstract:Goal-conditioned hierarchical reinforcement learning (HRL) is a promising approach for scaling up reinforcement learning (RL) techniques. However, it often suffers from training inefficiency as the action space of the high-level, i.e., the goal space, is often large. Searching in a large goal space poses difficulties for both high-level subgoal generation and low-level policy learning. In this paper, we show that this problem can be effectively alleviated by restricting the high-level action space from the whole goal space to a $k$-step adjacency region centered by the current state using an adjacency constraint. We theoretically prove that the proposed adjacency constraint preserves the optimal hierarchical policy, and show that this constraint can be practically implemented by training an adjacency network that can discriminate between adjacent and non-adjacent subgoals. Experimental results on discrete and continuous control tasks show that our method outperforms the state-of-the-art HRL approaches.

Via

Access Paper or Ask Questions