Picture for Zhoujun Cheng

Zhoujun Cheng

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Add code
Jun 26, 2024
Viaarxiv icon

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Add code
Apr 11, 2024
Figure 1 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 2 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 3 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 4 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Viaarxiv icon

What Are Tools Anyway? A Survey from the Language Model Perspective

Add code
Mar 18, 2024
Viaarxiv icon

OpenAgents: An Open Platform for Language Agents in the Wild

Add code
Oct 16, 2023
Viaarxiv icon

Lemur: Harmonizing Natural Language and Code for Language Agents

Add code
Oct 10, 2023
Viaarxiv icon

Batch Prompting: Efficient Inference with Large Language Model APIs

Add code
Jan 19, 2023
Viaarxiv icon

Reflection of Thought: Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems

Add code
Oct 11, 2022
Figure 1 for Reflection of Thought: Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems
Figure 2 for Reflection of Thought: Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems
Figure 3 for Reflection of Thought: Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems
Figure 4 for Reflection of Thought: Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems
Viaarxiv icon

Binding Language Models in Symbolic Languages

Add code
Oct 06, 2022
Figure 1 for Binding Language Models in Symbolic Languages
Figure 2 for Binding Language Models in Symbolic Languages
Figure 3 for Binding Language Models in Symbolic Languages
Figure 4 for Binding Language Models in Symbolic Languages
Viaarxiv icon

TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data

Add code
May 25, 2022
Figure 1 for TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data
Figure 2 for TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data
Figure 3 for TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data
Figure 4 for TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data
Viaarxiv icon

Table Pre-training: A Survey on Model Architectures, Pretraining Objectives, and Downstream Tasks

Add code
Jan 27, 2022
Figure 1 for Table Pre-training: A Survey on Model Architectures, Pretraining Objectives, and Downstream Tasks
Figure 2 for Table Pre-training: A Survey on Model Architectures, Pretraining Objectives, and Downstream Tasks
Figure 3 for Table Pre-training: A Survey on Model Architectures, Pretraining Objectives, and Downstream Tasks
Figure 4 for Table Pre-training: A Survey on Model Architectures, Pretraining Objectives, and Downstream Tasks
Viaarxiv icon