Picture for Alyssa Hwang

Alyssa Hwang

RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors

Add code
May 13, 2024
Figure 1 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Figure 2 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Figure 3 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Figure 4 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Viaarxiv icon

NewsQs: Multi-Source Question Generation for the Inquiring Mind

Add code
Feb 28, 2024
Viaarxiv icon

FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models

Add code
Feb 21, 2024
Viaarxiv icon

Grounded Intuition of GPT-Vision's Abilities with Scientific Images

Add code
Nov 03, 2023
Viaarxiv icon

Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications

Add code
Sep 11, 2023
Figure 1 for Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications
Figure 2 for Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications
Figure 3 for Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications
Figure 4 for Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications
Viaarxiv icon

Large Language Models as Sous Chefs: Revising Recipes with GPT-3

Add code
Jun 24, 2023
Figure 1 for Large Language Models as Sous Chefs: Revising Recipes with GPT-3
Figure 2 for Large Language Models as Sous Chefs: Revising Recipes with GPT-3
Figure 3 for Large Language Models as Sous Chefs: Revising Recipes with GPT-3
Viaarxiv icon

AMPERSAND: Argument Mining for PERSuAsive oNline Discussions

Add code
Apr 30, 2020
Figure 1 for AMPERSAND: Argument Mining for PERSuAsive oNline Discussions
Figure 2 for AMPERSAND: Argument Mining for PERSuAsive oNline Discussions
Figure 3 for AMPERSAND: Argument Mining for PERSuAsive oNline Discussions
Figure 4 for AMPERSAND: Argument Mining for PERSuAsive oNline Discussions
Viaarxiv icon