Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Liqian Peng

LLM Cascade with Multi-Objective Optimal Consideration

Oct 10, 2024

Kai Zhang, Liqian Peng, Congchao Wang, Alec Go, Xiaozhong Liu

Figure 1 for LLM Cascade with Multi-Objective Optimal Consideration

Figure 2 for LLM Cascade with Multi-Objective Optimal Consideration

Figure 3 for LLM Cascade with Multi-Objective Optimal Consideration

Figure 4 for LLM Cascade with Multi-Objective Optimal Consideration

Abstract:Large Language Models (LLMs) have demonstrated exceptional capabilities in understanding and generating natural language. However, their high deployment costs often pose a barrier to practical applications, especially. Cascading local and server models offers a promising solution to this challenge. While existing studies on LLM cascades have primarily focused on the performance-cost trade-off, real-world scenarios often involve more complex requirements. This paper introduces a novel LLM Cascade strategy with Multi-Objective Optimization, enabling LLM cascades to consider additional objectives (e.g., privacy) and better align with the specific demands of real-world applications while maintaining their original cascading abilities. Extensive experiments on three benchmarks validate the effectiveness and superiority of our approach.

Via

Access Paper or Ask Questions

Non-intrusive Nonlinear Model Reduction via Machine Learning Approximations to Low-dimensional Operators

Jun 17, 2021

Zhe Bai, Liqian Peng

Figure 1 for Non-intrusive Nonlinear Model Reduction via Machine Learning Approximations to Low-dimensional Operators

Figure 2 for Non-intrusive Nonlinear Model Reduction via Machine Learning Approximations to Low-dimensional Operators

Figure 3 for Non-intrusive Nonlinear Model Reduction via Machine Learning Approximations to Low-dimensional Operators

Figure 4 for Non-intrusive Nonlinear Model Reduction via Machine Learning Approximations to Low-dimensional Operators

Abstract:Although projection-based reduced-order models (ROMs) for parameterized nonlinear dynamical systems have demonstrated exciting results across a range of applications, their broad adoption has been limited by their intrusivity: implementing such a reduced-order model typically requires significant modifications to the underlying simulation code. To address this, we propose a method that enables traditionally intrusive reduced-order models to be accurately approximated in a non-intrusive manner. Specifically, the approach approximates the low-dimensional operators associated with projection-based reduced-order models (ROMs) using modern machine-learning regression techniques. The only requirement of the simulation code is the ability to export the velocity given the state and parameters as this functionality is used to train the approximated low-dimensional operators. In addition to enabling nonintrusivity, we demonstrate that the approach also leads to very low computational complexity, achieving up to $1000\times$ reduction in run time. We demonstrate the effectiveness of the proposed technique on two types of PDEs.

Via

Access Paper or Ask Questions