Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yitian Chen

Solver-Informed RL: Grounding Large Language Models for Authentic Optimization Modeling

May 17, 2025

Yitian Chen, Jingfan Xia, Siyu Shao, Dongdong Ge, Yinyu Ye

Abstract:Optimization modeling is fundamental to decision-making across diverse domains.Despite progress in automating optimization formulation from natural language descriptions, Large Language Models (LLMs) often struggle to generate formally correct and usable models due to hallucinations, posing a challenge for reliable automation. Inspired by the success of Reinforcement Learning (RL) in enhancing Large Reasoning Models, we present Solver-Informed Reinforcement Learning (SIRL).This novel framework leverages external optimization solvers as verifiable reward mechanisms to significantly improve the authenticity of LLMs for optimization modeling.Acting as precise verifiers, these solvers automatically assess the executable code and the instance-level mathematical model represented by the associated LP file, yielding precise and comprehensive feedback signals -- including syntax, feasibility, and solution quality that directly inform the RL process. This automated verification process, powered by classic optimization solvers, also underpins our instance-enhanced self-consistency method to synthesize high-quality training data. Extensive experiments on diverse public benchmarks demonstrate that SIRL achieves state-of-the-art performance, substantially outperforming existing methods in generating accurate and executable optimization models.

Via

Access Paper or Ask Questions

LLM-Powered Ensemble Learning for Paper Source Tracing: A GPU-Free Approach

Sep 17, 2024

Kunlong Chen, Junjun Wang, Zhaoqun Chen, Kunjin Chen, Yitian Chen

Abstract:We participated in the KDD CUP 2024 paper source tracing competition and achieved the 3rd place. This competition tasked participants with identifying the reference sources (i.e., ref-sources, as referred to by the organizers of the competition) of given academic papers. Unlike most teams that addressed this challenge by fine-tuning pre-trained neural language models such as BERT or ChatGLM, our primary approach utilized closed-source large language models (LLMs). With recent advancements in LLM technology, closed-source LLMs have demonstrated the capability to tackle complex reasoning tasks in zero-shot or few-shot scenarios. Consequently, in the absence of GPUs, we employed closed-source LLMs to directly generate predicted reference sources from the provided papers. We further refined these predictions through ensemble learning. Notably, our method was the only one among the award-winning approaches that did not require the use of GPUs for model training. Code available at https://github.com/Cklwanfifa/KDDCUP2024-PST.

Via

Access Paper or Ask Questions

ScrollTimes: Tracing the Provenance of Paintings as a Window into History

Jun 15, 2023

Wei Zhang, Jason K. Wong, Yitian Chen, Ailing Jia, Luwei Wang, Jian-Wei Zhang, Lechao Cheng, Wei Chen

Figure 1 for ScrollTimes: Tracing the Provenance of Paintings as a Window into History

Figure 2 for ScrollTimes: Tracing the Provenance of Paintings as a Window into History

Figure 3 for ScrollTimes: Tracing the Provenance of Paintings as a Window into History

Figure 4 for ScrollTimes: Tracing the Provenance of Paintings as a Window into History

Abstract:Digital humanities research has flourished due to the diverse artifacts available in cultural heritage databases. However, over-reliance on a single artifact type can result in poor contextualization and a constrained understanding of historical context. We collaborated with art historians to examine handscrolls, a form of traditional Chinese painting which offers a wealth of data for historical analysis and provides a unique opportunity for understanding history through artwork. We propose ScrollTimes, a visual analysis system for tracing handscroll historic context by linking multiple data sources. Specifically, a unique layout is developed for efficiently viewing long handscrolls. Using image processing techniques and language models, we extract, verify, and supplement elements in handscrolls with different cultural heritage databases. Furthermore, interactive biographies are constructed for handscrolls to uncover their historical narratives, provenance trajectories, and artistic legacies. Validated through case studies and expert interviews, our approach offers a window into history, fostering a holistic understanding of handscroll provenance and historical significance.

* Tech Report, 11 pages, 7 figures

Via

Access Paper or Ask Questions

Regret Analysis of Online LQR Control via Trajectory Prediction and Tracking: Extended Version

Feb 21, 2023

Yitian Chen, Timothy L. Molloy, Tyler Summers, Iman Shames

Abstract:In this paper, we propose and analyze a new method for online linear quadratic regulator (LQR) control with a priori unknown time-varying cost matrices. The cost matrices are revealed sequentially with the potential for future values to be previewed over a short window. Our novel method involves using the available cost matrices to predict the optimal trajectory, and a tracking controller to drive the system towards it. We adopted the notion of dynamic regret to measure the performance of this proposed online LQR control method, with our main result being that the (dynamic) regret of our method is upper bounded by a constant. Moreover, the regret upper bound decays exponentially with the preview window length, and is extendable to systems with disturbances. We show in simulations that our proposed method offers improved performance compared to other previously proposed online LQR methods.

* Submitted to L4DC2023

Via

Access Paper or Ask Questions

GP-NAS-ensemble: a model for NAS Performance Prediction

Jan 23, 2023

Kunlong Chen, Liu Yang, Yitian Chen, Kunjin Chen, Yidan Xu, Lujun Li

Abstract:It is of great significance to estimate the performance of a given model architecture without training in the application of Neural Architecture Search (NAS) as it may take a lot of time to evaluate the performance of an architecture. In this paper, a novel NAS framework called GP-NAS-ensemble is proposed to predict the performance of a neural network architecture with a small training dataset. We make several improvements on the GP-NAS model to make it share the advantage of ensemble learning methods. Our method ranks second in the CVPR2022 second lightweight NAS challenge performance prediction track.

Via

Access Paper or Ask Questions

DQN Control Solution for KDD Cup 2021 City Brain Challenge

Aug 14, 2021

Yitian Chen, Kunlong Chen, Kunjin Chen, Lin Wang

Figure 1 for DQN Control Solution for KDD Cup 2021 City Brain Challenge

Figure 2 for DQN Control Solution for KDD Cup 2021 City Brain Challenge

Figure 3 for DQN Control Solution for KDD Cup 2021 City Brain Challenge

Figure 4 for DQN Control Solution for KDD Cup 2021 City Brain Challenge

Abstract:We took part in the city brain challenge competition and achieved the 8th place. In this competition, the players are provided with a real-world city-scale road network and its traffic demand derived from real traffic data. The players are asked to coordinate the traffic signals with a self-designed agent to maximize the number of vehicles served while maintaining an acceptable delay. In this abstract paper, we present an overall analysis and our detailed solution to this competition. Our approach is mainly based on the adaptation of the deep Q-network (DQN) for real-time traffic signal control. From our perspective, the major challenge of this competition is how to extend the classical DQN framework to traffic signals control in real-world complex road network and traffic flow situation. After trying and implementing several classical reward functions, we finally chose to apply our newly-designed reward in our agent. By applying our newly-proposed reward function and carefully tuning the control scheme, an agent based on a single DQN model can rank among the top 15 teams. We hope this paper could serve, to some extent, as a baseline solution to traffic signal control of real-world road network and inspire further attempts and researches.

* 5 pages, report for KDD Cup 2021 City Brain Challenge workshop

Via

Access Paper or Ask Questions

Probabilistic Forecasting with Temporal Convolutional Neural Network

Jun 11, 2019

Yitian Chen, Yanfei Kang, Yixiong Chen, Zizhuo Wang

Figure 1 for Probabilistic Forecasting with Temporal Convolutional Neural Network

Figure 2 for Probabilistic Forecasting with Temporal Convolutional Neural Network

Figure 3 for Probabilistic Forecasting with Temporal Convolutional Neural Network

Figure 4 for Probabilistic Forecasting with Temporal Convolutional Neural Network

Abstract:We present a probabilistic forecasting framework based on convolutional neural network for multiple related time series forecasting. The framework can be applied to estimate probability density under both parametric and non-parametric settings. More specifically, stacked residual blocks based on dilated causal convolutional nets are constructed to capture the temporal dependencies of the series. Combined with representation learning, our approach is able to learn complex patterns such as seasonality, holiday effects within and across series, and to leverage those patterns for more accurate forecasts, especially when historical data is sparse or unavailable. Extensive empirical studies are performed on several real-world datasets, including datasets from JD.com, China's largest online retailer. The results show that our framework outperforms other state-of-the-art methods in both accuracy and efficiency.

* 24 pages, 3 figures, 8 tables

Via

Access Paper or Ask Questions