Picture for Ruoxi Sun

Ruoxi Sun

Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies

Add code
Feb 04, 2025
Viaarxiv icon

SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling

Add code
Jan 31, 2025
Figure 1 for SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling
Figure 2 for SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling
Figure 3 for SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling
Figure 4 for SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling
Viaarxiv icon

Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Add code
Jan 18, 2025
Figure 1 for Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Figure 2 for Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Figure 3 for Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Figure 4 for Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Viaarxiv icon

Data-Centric Improvements for Enhancing Multi-Modal Understanding in Spoken Conversation Modeling

Add code
Dec 20, 2024
Viaarxiv icon

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Add code
Nov 12, 2024
Figure 1 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Figure 2 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Figure 3 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Figure 4 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Viaarxiv icon

AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems

Add code
Nov 09, 2024
Figure 1 for AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems
Figure 2 for AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems
Figure 3 for AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems
Figure 4 for AI-Compass: A Comprehensive and Effective Multi-module Testing Tool for AI Systems
Viaarxiv icon

Edge Unlearning is Not "on Edge"! An Adaptive Exact Unlearning System on Resource-Constrained Devices

Add code
Oct 15, 2024
Viaarxiv icon

Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models

Add code
Oct 09, 2024
Viaarxiv icon

CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL

Add code
Oct 02, 2024
Figure 1 for CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL
Figure 2 for CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL
Figure 3 for CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL
Figure 4 for CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL
Viaarxiv icon

SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging

Add code
Aug 22, 2024
Figure 1 for SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging
Figure 2 for SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging
Figure 3 for SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging
Figure 4 for SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging
Viaarxiv icon