Picture for Ruisheng Cao

Ruisheng Cao

Reducing Tool Hallucination via Reliability Alignment

Add code
Dec 05, 2024
Figure 1 for Reducing Tool Hallucination via Reliability Alignment
Figure 2 for Reducing Tool Hallucination via Reliability Alignment
Figure 3 for Reducing Tool Hallucination via Reliability Alignment
Figure 4 for Reducing Tool Hallucination via Reliability Alignment
Viaarxiv icon

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Add code
Nov 12, 2024
Figure 1 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Figure 2 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Figure 3 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Figure 4 for Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
Viaarxiv icon

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Add code
Jul 15, 2024
Figure 1 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 2 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 3 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Figure 4 for Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Viaarxiv icon

CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions

Add code
May 04, 2024
Viaarxiv icon

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Add code
Apr 11, 2024
Figure 1 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 2 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 3 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 4 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Viaarxiv icon

A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames

Add code
Feb 28, 2024
Figure 1 for A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames
Figure 2 for A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames
Figure 3 for A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames
Figure 4 for A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames
Viaarxiv icon

Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding

Add code
Feb 28, 2024
Viaarxiv icon

ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL

Add code
Oct 28, 2023
Viaarxiv icon

ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought

Add code
Oct 26, 2023
Viaarxiv icon

CSS: A Large-scale Cross-schema Chinese Text-to-SQL Medical Dataset

Add code
May 25, 2023
Viaarxiv icon