Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhun Yang

LPMLN, Weak Constraints, and P-log

Jun 15, 2025

Joohyung Lee, Zhun Yang

Abstract:LPMLN is a recently introduced formalism that extends answer set programs by adopting the log-linear weight scheme of Markov Logic. This paper investigates the relationships between LPMLN and two other extensions of answer set programs: weak constraints to express a quantitative preference among answer sets, and P-log to incorporate probabilistic uncertainty. We present a translation of LPMLN into programs with weak constraints and a translation of P-log into LPMLN, which complement the existing translations in the opposite directions. The first translation allows us to compute the most probable stable models (i.e., MAP estimates) of LPMLN programs using standard ASP solvers. This result can be extended to other formalisms, such as Markov Logic, ProbLog, and Pearl's Causal Models, that are shown to be translatable into LPMLN. The second translation tells us how probabilistic nonmonotonicity (the ability of the reasoner to change his probabilistic model as a result of new information) of P-log can be represented in LPMLN, which yields a way to compute P-log using standard ASP solvers and MLN solvers.

* In Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI 2017), pages 1170-1177, 2017

Via

Access Paper or Ask Questions

Think before You Simulate: Symbolic Reasoning to Orchestrate Neural Computation for Counterfactual Question Answering

Jun 12, 2025

Adam Ishay, Zhun Yang, Joohyung Lee, Ilgu Kang, Dongjae Lim

Abstract:Causal and temporal reasoning about video dynamics is a challenging problem. While neuro-symbolic models that combine symbolic reasoning with neural-based perception and prediction have shown promise, they exhibit limitations, especially in answering counterfactual questions. This paper introduces a method to enhance a neuro-symbolic model for counterfactual reasoning, leveraging symbolic reasoning about causal relations among events. We define the notion of a causal graph to represent such relations and use Answer Set Programming (ASP), a declarative logic programming method, to find how to coordinate perception and simulation modules. We validate the effectiveness of our approach on two benchmarks, CLEVRER and CRAFT. Our enhancement achieves state-of-the-art performance on the CLEVRER challenge, significantly outperforming existing models. In the case of the CRAFT benchmark, we leverage a large pre-trained language model, such as GPT-3.5 and GPT-4, as a proxy for a dynamics simulator. Our findings show that this method can further improve its performance on counterfactual questions by providing alternative prompts instructed by symbolic causal reasoning.

* In Proceedings the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

Via

Access Paper or Ask Questions

Towards a Personal Health Large Language Model

Jun 10, 2024

Justin Cosentino, Anastasiya Belyaeva, Xin Liu, Nicholas A. Furlotte, Zhun Yang, Chace Lee, Erik Schenck, Yojan Patel, Jian Cui, Logan Douglas Schneider(+24 more)

Figure 1 for Towards a Personal Health Large Language Model

Figure 2 for Towards a Personal Health Large Language Model

Figure 3 for Towards a Personal Health Large Language Model

Figure 4 for Towards a Personal Health Large Language Model

Abstract:In health, most large language model (LLM) research has focused on clinical tasks. However, mobile and wearable devices, which are rarely integrated into such tasks, provide rich, longitudinal data for personal health monitoring. Here we present Personal Health Large Language Model (PH-LLM), fine-tuned from Gemini for understanding and reasoning over numerical time-series personal health data. We created and curated three datasets that test 1) production of personalized insights and recommendations from sleep patterns, physical activity, and physiological responses, 2) expert domain knowledge, and 3) prediction of self-reported sleep outcomes. For the first task we designed 857 case studies in collaboration with domain experts to assess real-world scenarios in sleep and fitness. Through comprehensive evaluation of domain-specific rubrics, we observed that Gemini Ultra 1.0 and PH-LLM are not statistically different from expert performance in fitness and, while experts remain superior for sleep, fine-tuning PH-LLM provided significant improvements in using relevant domain knowledge and personalizing information for sleep insights. We evaluated PH-LLM domain knowledge using multiple choice sleep medicine and fitness examinations. PH-LLM achieved 79% on sleep and 88% on fitness, exceeding average scores from a sample of human experts. Finally, we trained PH-LLM to predict self-reported sleep quality outcomes from textual and multimodal encoding representations of wearable data, and demonstrate that multimodal encoding is required to match performance of specialized discriminative models. Although further development and evaluation are necessary in the safety-critical personal health domain, these results demonstrate both the broad knowledge and capabilities of Gemini models and the benefit of contextualizing physiological data for personal health applications as done with PH-LLM.

* 72 pages

Via

Access Paper or Ask Questions

Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text

Jul 15, 2023

Zhun Yang, Adam Ishay, Joohyung Lee

Figure 1 for Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text

Figure 2 for Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text

Figure 3 for Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text

Figure 4 for Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text

Abstract:While large language models (LLMs), such as GPT-3, appear to be robust and general, their reasoning ability is not at a level to compete with the best models trained for specific natural language reasoning problems. In this study, we observe that a large language model can serve as a highly effective few-shot semantic parser. It can convert natural language sentences into a logical form that serves as input for answer set programs, a logic-based declarative knowledge representation formalism. The combination results in a robust and general system that can handle multiple question-answering tasks without requiring retraining for each new task. It only needs a few examples to guide the LLM's adaptation to a specific task, along with reusable ASP knowledge modules that can be applied to multiple tasks. We demonstrate that this method achieves state-of-the-art performance on several NLP benchmarks, including bAbI, StepGame, CLUTRR, and gSCAN. Additionally, it successfully tackles robot planning tasks that an LLM alone fails to solve.

* 32 pages, Findings of the Association for Computational Linguistics: ACL 2023, 5186-5219

Via

Access Paper or Ask Questions

Leveraging Large Language Models to Generate Answer Set Programs

Jul 15, 2023

Adam Ishay, Zhun Yang, Joohyung Lee

Figure 1 for Leveraging Large Language Models to Generate Answer Set Programs

Figure 2 for Leveraging Large Language Models to Generate Answer Set Programs

Figure 3 for Leveraging Large Language Models to Generate Answer Set Programs

Figure 4 for Leveraging Large Language Models to Generate Answer Set Programs

Abstract:Large language models (LLMs), such as GPT-3 and GPT-4, have demonstrated exceptional performance in various natural language processing tasks and have shown the ability to solve certain reasoning problems. However, their reasoning capabilities are limited and relatively shallow, despite the application of various prompting techniques. In contrast, formal logic is adept at handling complex reasoning, but translating natural language descriptions into formal logic is a challenging task that non-experts struggle with. This paper proposes a neuro-symbolic method that combines the strengths of large language models and answer set programming. Specifically, we employ an LLM to transform natural language descriptions of logic puzzles into answer set programs. We carefully design prompts for an LLM to convert natural language descriptions into answer set programs in a step by step manner. Surprisingly, with just a few in-context learning examples, LLMs can generate reasonably complex answer set programs. The majority of errors made are relatively simple and can be easily corrected by humans, thus enabling LLMs to effectively assist in the creation of answer set programs.

* 17 pages, KR 2023

Via

Access Paper or Ask Questions

NeurASP: Embracing Neural Networks into Answer Set Programming

Jul 15, 2023

Zhun Yang, Adam Ishay, Joohyung Lee

Figure 1 for NeurASP: Embracing Neural Networks into Answer Set Programming

Figure 2 for NeurASP: Embracing Neural Networks into Answer Set Programming

Figure 3 for NeurASP: Embracing Neural Networks into Answer Set Programming

Figure 4 for NeurASP: Embracing Neural Networks into Answer Set Programming

Abstract:We present NeurASP, a simple extension of answer set programs by embracing neural networks. By treating the neural network output as the probability distribution over atomic facts in answer set programs, NeurASP provides a simple and effective way to integrate sub-symbolic and symbolic computation. We demonstrate how NeurASP can make use of a pre-trained neural network in symbolic computation and how it can improve the neural network's perception result by applying symbolic reasoning in answer set programming. Also, NeurASP can be used to train a neural network better by training with ASP rules so that a neural network not only learns from implicit correlations from the data but also from the explicit complex semantic constraints expressed by the rules.

* 16 pages, 29th International Joint Conference on Artificial Intelligence (IJCAI 2020). arXiv admin note: substantial text overlap with arXiv:2009.10256

Via

Access Paper or Ask Questions

Injecting Logical Constraints into Neural Networks via Straight-Through Estimators

Jul 10, 2023

Zhun Yang, Joohyung Lee, Chiyoun Park

Figure 1 for Injecting Logical Constraints into Neural Networks via Straight-Through Estimators

Figure 2 for Injecting Logical Constraints into Neural Networks via Straight-Through Estimators

Figure 3 for Injecting Logical Constraints into Neural Networks via Straight-Through Estimators

Figure 4 for Injecting Logical Constraints into Neural Networks via Straight-Through Estimators

Abstract:Injecting discrete logical constraints into neural network learning is one of the main challenges in neuro-symbolic AI. We find that a straight-through-estimator, a method introduced to train binary neural networks, could effectively be applied to incorporate logical constraints into neural network learning. More specifically, we design a systematic way to represent discrete logical constraints as a loss function; minimizing this loss using gradient descent via a straight-through-estimator updates the neural network's weights in the direction that the binarized outputs satisfy the logical constraints. The experimental results show that by leveraging GPUs and batch training, this method scales significantly better than existing neuro-symbolic methods that require heavy symbolic computation for computing gradients. Also, we demonstrate that our method applies to different types of neural networks, such as MLP, CNN, and GNN, making them learn with no or fewer labeled data by learning directly from known constraints.

* PMLR 162:25096-25122 (ICML 2022)
* 27 pages

Via

Access Paper or Ask Questions

Learning to Solve Constraint Satisfaction Problems with Recurrent Transformer

Jul 10, 2023

Zhun Yang, Adam Ishay, Joohyung Lee

Figure 1 for Learning to Solve Constraint Satisfaction Problems with Recurrent Transformer

Figure 2 for Learning to Solve Constraint Satisfaction Problems with Recurrent Transformer

Figure 3 for Learning to Solve Constraint Satisfaction Problems with Recurrent Transformer

Figure 4 for Learning to Solve Constraint Satisfaction Problems with Recurrent Transformer

Abstract:Constraint satisfaction problems (CSPs) are about finding values of variables that satisfy the given constraints. We show that Transformer extended with recurrence is a viable approach to learning to solve CSPs in an end-to-end manner, having clear advantages over state-of-the-art methods such as Graph Neural Networks, SATNet, and some neuro-symbolic models. With the ability of Transformer to handle visual input, the proposed Recurrent Transformer can straightforwardly be applied to visual constraint reasoning problems while successfully addressing the symbol grounding problem. We also show how to leverage deductive knowledge of discrete constraints in the Transformer's inductive learning to achieve sample-efficient learning and semi-supervised learning for CSPs.

* 22 pages. The Eleventh International Conference on Learning Representations (ICLR 2023)

Via

Access Paper or Ask Questions

Extending Answer Set Programs with Neural Networks

Sep 22, 2020

Zhun Yang

Figure 1 for Extending Answer Set Programs with Neural Networks

Figure 2 for Extending Answer Set Programs with Neural Networks

Figure 3 for Extending Answer Set Programs with Neural Networks

Abstract:The integration of low-level perception with high-level reasoning is one of the oldest problems in Artificial Intelligence. Recently, several proposals were made to implement the reasoning process in complex neural network architectures. While these works aim at extending neural networks with the capability of reasoning, a natural question that we consider is: can we extend answer set programs with neural networks to allow complex and high-level reasoning on neural network outputs? As a preliminary result, we propose NeurASP -- a simple extension of answer set programs by embracing neural networks where neural network outputs are treated as probability distributions over atomic facts in answer set programs. We show that NeurASP can not only improve the perception accuracy of a pre-trained neural network, but also help to train a neural network better by giving restrictions through logic rules. However, training with NeurASP would take much more time than pure neural network training due to the internal use of a symbolic reasoning engine. For future work, we plan to investigate the potential ways to solve the scalability issue of NeurASP. One potential way is to embed logic programs directly in neural networks. On this route, we plan to first design a SAT solver using neural networks, then extend such a solver to allow logic programs.

* EPTCS 325, 2020, pp. 313-322
* In Proceedings ICLP 2020, arXiv:2009.09158

Via

Access Paper or Ask Questions

Translating LPOD and CR-Prolog2 into Standard Answer Set Programs

May 02, 2018

Joohyung Lee, Zhun Yang

Abstract:Logic Programs with Ordered Disjunction (LPOD) is an extension of standard answer set programs to handle preference using the construct of ordered disjunction, and CR-Prolog2 is an extension of standard answer set programs with consistency restoring rules and LPOD-like ordered disjunction. We present reductions of each of these languages into the standard ASP language, which gives us an alternative way to understand the extensions in terms of the standard ASP language.

* Paper presented at the 34nd International Conference on Logic Programming (ICLP 2018), Oxford, UK, July 14 to July 17, 2018 18 pages, LaTeX, 0 PDF figures (arXiv:YYMM.NNNNN)

Via

Access Paper or Ask Questions