Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shengyu Feng

CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization

Apr 06, 2025

Weiwei Sun, Shengyu Feng, Shanda Li, Yiming Yang

Abstract:Although LLM-based agents have attracted significant attention in domains such as software engineering and machine learning research, their role in advancing combinatorial optimization (CO) remains relatively underexplored. This gap underscores the need for a deeper understanding of their potential in tackling structured, constraint-intensive problems-a pursuit currently limited by the absence of comprehensive benchmarks for systematic investigation. To address this, we introduce CO-Bench, a benchmark suite featuring 36 real-world CO problems drawn from a broad range of domains and complexity levels. CO-Bench includes structured problem formulations and curated data to support rigorous investigation of LLM agents. We evaluate multiple agent frameworks against established human-designed algorithms, revealing key strengths and limitations of current approaches and identifying promising directions for future research. CO-Bench is publicly available at https://github.com/sunnweiwei/CO-Bench.

Via

Access Paper or Ask Questions

SORREL: Suboptimal-Demonstration-Guided Reinforcement Learning for Learning to Branch

Dec 20, 2024

Shengyu Feng, Yiming Yang

Abstract:Mixed Integer Linear Program (MILP) solvers are mostly built upon a Branch-and-Bound (B\&B) algorithm, where the efficiency of traditional solvers heavily depends on hand-crafted heuristics for branching. The past few years have witnessed the increasing popularity of data-driven approaches to automatically learn these heuristics. However, the success of these methods is highly dependent on the availability of high-quality demonstrations, which requires either the development of near-optimal heuristics or a time-consuming sampling process. This paper averts this challenge by proposing Suboptimal-Demonstration-Guided Reinforcement Learning (SORREL) for learning to branch. SORREL selectively learns from suboptimal demonstrations based on value estimation. It utilizes suboptimal demonstrations through both offline reinforcement learning on the demonstrations generated by suboptimal heuristics and self-imitation learning on past good experiences sampled by itself. Our experiments demonstrate its advanced performance in both branching quality and training efficiency over previous methods for various MILPs.

* AAAI 2025

Via

Access Paper or Ask Questions

Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo

Oct 02, 2024

Shengyu Feng, Xiang Kong, Shuang Ma, Aonan Zhang, Dong Yin, Chong Wang, Ruoming Pang, Yiming Yang

Figure 1 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo

Figure 2 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo

Figure 3 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo

Figure 4 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo

Abstract:Augmenting the multi-step reasoning abilities of Large Language Models (LLMs) has been a persistent challenge. Recently, verification has shown promise in improving solution consistency by evaluating generated outputs. However, current verification approaches suffer from sampling inefficiencies, requiring a large number of samples to achieve satisfactory performance. Additionally, training an effective verifier often depends on extensive process supervision, which is costly to acquire. In this paper, we address these limitations by introducing a novel verification method based on Twisted Sequential Monte Carlo (TSMC). TSMC sequentially refines its sampling effort to focus exploration on promising candidates, resulting in more efficient generation of high-quality solutions. We apply TSMC to LLMs by estimating the expected future rewards at partial solutions. This approach results in a more straightforward training target that eliminates the need for step-wise human annotations. We empirically demonstrate the advantages of our method across multiple math benchmarks, and also validate our theoretical analysis of both our approach and existing verification methods.

Via

Access Paper or Ask Questions

Concept Discovery for Fast Adapatation

Jan 19, 2023

Shengyu Feng, Hanghang Tong

Figure 1 for Concept Discovery for Fast Adapatation

Figure 2 for Concept Discovery for Fast Adapatation

Figure 3 for Concept Discovery for Fast Adapatation

Figure 4 for Concept Discovery for Fast Adapatation

Abstract:The advances in deep learning have enabled machine learning methods to outperform human beings in various areas, but it remains a great challenge for a well-trained model to quickly adapt to a new task. One promising solution to realize this goal is through meta-learning, also known as learning to learn, which has achieved promising results in few-shot learning. However, current approaches are still enormously different from human beings' learning process, especially in the ability to extract structural and transferable knowledge. This drawback makes current meta-learning frameworks non-interpretable and hard to extend to more complex tasks. We tackle this problem by introducing concept discovery to the few-shot learning problem, where we achieve more effective adaptation by meta-learning the structure among the data features, leading to a composite representation of the data. Our proposed method Concept-Based Model-Agnostic Meta-Learning (COMAML) has been shown to achieve consistent improvements in the structured data for both synthesized datasets and real-world datasets.

* SDM23

Via

Access Paper or Ask Questions

ARIEL: Adversarial Graph Contrastive Learning

Aug 15, 2022

Shengyu Feng, Baoyu Jing, Yada Zhu, Hanghang Tong

Figure 1 for ARIEL: Adversarial Graph Contrastive Learning

Figure 2 for ARIEL: Adversarial Graph Contrastive Learning

Figure 3 for ARIEL: Adversarial Graph Contrastive Learning

Figure 4 for ARIEL: Adversarial Graph Contrastive Learning

Abstract:Contrastive learning is an effective unsupervised method in graph representation learning, and the key component of contrastive learning lies in the construction of positive and negative samples. Previous methods usually utilize the proximity of nodes in the graph as the principle. Recently, the data augmentation based contrastive learning method has advanced to show great power in the visual domain, and some works extended this method from images to graphs. However, unlike the data augmentation on images, the data augmentation on graphs is far less intuitive and much harder to provide high-quality contrastive samples, which leaves much space for improvement. In this work, by introducing an adversarial graph view for data augmentation, we propose a simple but effective method, Adversarial Graph Contrastive Learning (ARIEL), to extract informative contrastive samples within reasonable constraints. We develop a new technique called information regularization for stable training and use subgraph sampling for scalability. We generalize our method from node-level contrastive learning to the graph-level by treating each graph instance as a supernode. ARIEL consistently outperforms the current graph contrastive learning methods for both node-level and graph-level classification tasks on real-world datasets. We further demonstrate that ARIEL is more robust in face of adversarial attacks.

Via

Access Paper or Ask Questions

Adversarial Graph Contrastive Learning with Information Regularization

Mar 03, 2022

Shengyu Feng, Baoyu Jing, Yada Zhu, Hanghang Tong

Figure 1 for Adversarial Graph Contrastive Learning with Information Regularization

Figure 2 for Adversarial Graph Contrastive Learning with Information Regularization

Figure 3 for Adversarial Graph Contrastive Learning with Information Regularization

Figure 4 for Adversarial Graph Contrastive Learning with Information Regularization

Abstract:Contrastive learning is an effective unsupervised method in graph representation learning. Recently, the data augmentation based contrastive learning method has been extended from images to graphs. However, most prior works are directly adapted from the models designed for images. Unlike the data augmentation on images, the data augmentation on graphs is far less intuitive and much harder to provide high-quality contrastive samples, which are the key to the performance of contrastive learning models. This leaves much space for improvement over the existing graph contrastive learning frameworks. In this work, by introducing an adversarial graph view and an information regularizer, we propose a simple but effective method, Adversarial Graph Contrastive Learning (ARIEL), to extract informative contrastive samples within a reasonable constraint. It consistently outperforms the current graph contrastive learning methods in the node classification task over various real-world datasets and further improves the robustness of graph contrastive learning.

* WWW 2022

Via

Access Paper or Ask Questions

Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Dec 18, 2021

Shengyu Feng, Subarna Tripathi, Hesham Mostafa, Marcel Nassar, Somdeb Majumdar

Figure 1 for Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Figure 2 for Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Figure 3 for Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Figure 4 for Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs

Abstract:Structured video representation in the form of dynamic scene graphs is an effective tool for several video understanding tasks. Compared to the task of scene graph generation from images, dynamic scene graph generation is more challenging due to the temporal dynamics of the scene and the inherent temporal fluctuations of predictions. We show that capturing long-term dependencies is the key to effective generation of dynamic scene graphs. We present the detect-track-recognize paradigm by constructing consistent long-term object tracklets from a video, followed by transformers to capture the dynamics of objects and visual relations. Experimental results demonstrate that our Dynamic Scene Graph Detection Transformer (DSG-DETR) outperforms state-of-the-art methods by a significant margin on the benchmark dataset Action Genome. We also perform ablation studies and validate the effectiveness of each component of the proposed approach.

Via

Access Paper or Ask Questions