Picture for Andrew Zhu

Andrew Zhu

GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge

Add code
Jan 15, 2025
Viaarxiv icon

You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling

Add code
Sep 11, 2024
Figure 1 for You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling
Figure 2 for You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling
Figure 3 for You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling
Figure 4 for You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling
Viaarxiv icon

ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems

Add code
Aug 05, 2024
Figure 1 for ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems
Figure 2 for ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems
Figure 3 for ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems
Figure 4 for ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems
Viaarxiv icon

RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors

Add code
May 13, 2024
Figure 1 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Figure 2 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Figure 3 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Figure 4 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Viaarxiv icon

FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models

Add code
Feb 21, 2024
Viaarxiv icon

Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications

Add code
Sep 11, 2023
Figure 1 for Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications
Figure 2 for Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications
Figure 3 for Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications
Figure 4 for Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications
Viaarxiv icon

CALYPSO: LLMs as Dungeon Masters' Assistants

Add code
Aug 15, 2023
Figure 1 for CALYPSO: LLMs as Dungeon Masters' Assistants
Figure 2 for CALYPSO: LLMs as Dungeon Masters' Assistants
Figure 3 for CALYPSO: LLMs as Dungeon Masters' Assistants
Figure 4 for CALYPSO: LLMs as Dungeon Masters' Assistants
Viaarxiv icon

FIREBALL: A Dataset of Dungeons and Dragons Actual-Play with Structured Game State Information

Add code
May 08, 2023
Viaarxiv icon

An AI Dungeon Master's Guide: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons

Add code
Dec 20, 2022
Viaarxiv icon