Picture for Ziru Chen

Ziru Chen

Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving

Add code
Nov 11, 2024
Viaarxiv icon

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Add code
Oct 07, 2024
Figure 1 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Figure 2 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Figure 3 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Figure 4 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Viaarxiv icon

Optimizing NOMA Transmissions to Advance Federated Learning in Vehicular Networks

Add code
Aug 06, 2024
Figure 1 for Optimizing NOMA Transmissions to Advance Federated Learning in Vehicular Networks
Figure 2 for Optimizing NOMA Transmissions to Advance Federated Learning in Vehicular Networks
Figure 3 for Optimizing NOMA Transmissions to Advance Federated Learning in Vehicular Networks
Figure 4 for Optimizing NOMA Transmissions to Advance Federated Learning in Vehicular Networks
Viaarxiv icon

When is Tree Search Useful for LLM Planning? It Depends on the Discriminator

Add code
Feb 16, 2024
Viaarxiv icon

eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data

Add code
Feb 13, 2024
Viaarxiv icon

Roll Up Your Sleeves: Working with a Collaborative and Engaging Task-Oriented Dialogue System

Add code
Jul 29, 2023
Viaarxiv icon

Error Detection for Text-to-SQL Semantic Parsing

Add code
May 23, 2023
Figure 1 for Error Detection for Text-to-SQL Semantic Parsing
Figure 2 for Error Detection for Text-to-SQL Semantic Parsing
Figure 3 for Error Detection for Text-to-SQL Semantic Parsing
Figure 4 for Error Detection for Text-to-SQL Semantic Parsing
Viaarxiv icon

Exploring Chain-of-Thought Style Prompting for Text-to-SQL

Add code
May 23, 2023
Viaarxiv icon

Text-to-SQL Error Correction with Language Models of Code

Add code
May 22, 2023
Viaarxiv icon

Automatic Evaluation of Attribution by Large Language Models

Add code
May 10, 2023
Viaarxiv icon