Picture for Ruoxi Sun

Ruoxi Sun

Keep the Lights On, Keep the Lengths in Check: Plug-In Adversarial Detection for Time-Series LLMs in Energy Forecasting

Add code
Dec 13, 2025
Viaarxiv icon

VISTAR:A User-Centric and Role-Driven Benchmark for Text-to-Image Evaluation

Add code
Aug 08, 2025
Viaarxiv icon

What's Pulling the Strings? Evaluating Integrity and Attribution in AI Training and Inference through Concept Shift

Add code
Apr 28, 2025
Viaarxiv icon

Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection

Add code
Apr 02, 2025
Viaarxiv icon

Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL

Add code
Apr 01, 2025
Figure 1 for Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL
Figure 2 for Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL
Figure 3 for Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL
Figure 4 for Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL
Viaarxiv icon

Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies

Add code
Feb 04, 2025
Viaarxiv icon

SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling

Add code
Jan 31, 2025
Figure 1 for SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling
Figure 2 for SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling
Figure 3 for SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling
Figure 4 for SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling
Viaarxiv icon

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Add code
Jan 18, 2025
Figure 1 for Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Figure 2 for Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Figure 3 for Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Figure 4 for Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Viaarxiv icon

Data-Centric Improvements for Enhancing Multi-Modal Understanding in Spoken Conversation Modeling

Add code
Dec 20, 2024
Viaarxiv icon

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Add code
Nov 12, 2024
Figure 1 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Figure 2 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Figure 3 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Figure 4 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Viaarxiv icon