Picture for Xingang Guo

Xingang Guo

DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models

Add code
Oct 29, 2024
Viaarxiv icon

Benchmarking the Capabilities of Large Language Models in Transportation System Engineering: Accuracy, Consistency, and Reasoning Behaviors

Add code
Aug 15, 2024
Viaarxiv icon

Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra

Add code
Apr 04, 2024
Viaarxiv icon

Model-Free $μ$-Synthesis: A Nonsmooth Optimization Perspective

Add code
Feb 18, 2024
Viaarxiv icon

COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability

Add code
Feb 13, 2024
Viaarxiv icon

Exact Formulas for Finite-Time Estimation Errors of Decentralized Temporal Difference Learning with Linear Function Approximation

Add code
Apr 20, 2022
Viaarxiv icon

Convex Programs and Lyapunov Functions for Reinforcement Learning: A Unified Perspective on the Analysis of Value-Based Methods

Add code
Feb 14, 2022
Figure 1 for Convex Programs and Lyapunov Functions for Reinforcement Learning: A Unified Perspective on the Analysis of Value-Based Methods
Viaarxiv icon