Picture for Yuting Ning

Yuting Ning

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Add code
Oct 07, 2024
Figure 1 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Figure 2 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Figure 3 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Figure 4 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Viaarxiv icon

Pandora: Towards General World Model with Natural Language Actions and Video States

Add code
Jun 12, 2024
Viaarxiv icon

EduNLP: Towards a Unified and Modularized Library for Educational Resources

Add code
Jun 04, 2024
Viaarxiv icon

In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search

Add code
Nov 13, 2023
Figure 1 for In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search
Figure 2 for In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search
Figure 3 for In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search
Figure 4 for In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search
Viaarxiv icon

Efficiently Measuring the Cognitive Ability of LLMs: An Adaptive Testing Perspective

Add code
Jun 18, 2023
Viaarxiv icon

A Novel Approach for Auto-Formulation of Optimization Problems

Add code
Feb 09, 2023
Viaarxiv icon

Towards a Holistic Understanding of Mathematical Questions with Contrastive Pre-training

Add code
Jan 18, 2023
Viaarxiv icon