Picture for Kaheer Suleman

Kaheer Suleman

Investigating Failures to Generalize for Coreference Resolution Models

Add code
Mar 16, 2023
Figure 1 for Investigating Failures to Generalize for Coreference Resolution Models
Figure 2 for Investigating Failures to Generalize for Coreference Resolution Models
Figure 3 for Investigating Failures to Generalize for Coreference Resolution Models
Figure 4 for Investigating Failures to Generalize for Coreference Resolution Models
Viaarxiv icon

The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources in Natural Language Understanding Systems

Add code
Dec 15, 2022
Viaarxiv icon

Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications

Add code
May 13, 2022
Figure 1 for Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications
Figure 2 for Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications
Figure 3 for Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications
Figure 4 for Deconstructing NLG Evaluation: Evaluation Practices, Assumptions, and Their Implications
Viaarxiv icon

TopiOCQA: Open-domain Conversational Question Answeringwith Topic Switching

Add code
Oct 02, 2021
Figure 1 for TopiOCQA: Open-domain Conversational Question Answeringwith Topic Switching
Figure 2 for TopiOCQA: Open-domain Conversational Question Answeringwith Topic Switching
Figure 3 for TopiOCQA: Open-domain Conversational Question Answeringwith Topic Switching
Figure 4 for TopiOCQA: Open-domain Conversational Question Answeringwith Topic Switching
Viaarxiv icon

Modeling Event Plausibility with Consistent Conceptual Abstraction

Add code
Apr 20, 2021
Figure 1 for Modeling Event Plausibility with Consistent Conceptual Abstraction
Figure 2 for Modeling Event Plausibility with Consistent Conceptual Abstraction
Figure 3 for Modeling Event Plausibility with Consistent Conceptual Abstraction
Figure 4 for Modeling Event Plausibility with Consistent Conceptual Abstraction
Viaarxiv icon

An Analysis of Dataset Overlap on Winograd-Style Tasks

Add code
Nov 09, 2020
Figure 1 for An Analysis of Dataset Overlap on Winograd-Style Tasks
Figure 2 for An Analysis of Dataset Overlap on Winograd-Style Tasks
Figure 3 for An Analysis of Dataset Overlap on Winograd-Style Tasks
Figure 4 for An Analysis of Dataset Overlap on Winograd-Style Tasks
Viaarxiv icon

Can a Gorilla Ride a Camel? Learning Semantic Plausibility from Text

Add code
Nov 13, 2019
Figure 1 for Can a Gorilla Ride a Camel? Learning Semantic Plausibility from Text
Figure 2 for Can a Gorilla Ride a Camel? Learning Semantic Plausibility from Text
Figure 3 for Can a Gorilla Ride a Camel? Learning Semantic Plausibility from Text
Figure 4 for Can a Gorilla Ride a Camel? Learning Semantic Plausibility from Text
Viaarxiv icon

Improving Neural Question Generation using World Knowledge

Add code
Sep 10, 2019
Figure 1 for Improving Neural Question Generation using World Knowledge
Figure 2 for Improving Neural Question Generation using World Knowledge
Figure 3 for Improving Neural Question Generation using World Knowledge
Viaarxiv icon

Playing log(N)-Questions over Sentences

Add code
Aug 13, 2019
Figure 1 for Playing log(N)-Questions over Sentences
Figure 2 for Playing log(N)-Questions over Sentences
Figure 3 for Playing log(N)-Questions over Sentences
Figure 4 for Playing log(N)-Questions over Sentences
Viaarxiv icon

On the Evaluation of Common-Sense Reasoning in Natural Language Understanding

Add code
Nov 05, 2018
Figure 1 for On the Evaluation of Common-Sense Reasoning in Natural Language Understanding
Figure 2 for On the Evaluation of Common-Sense Reasoning in Natural Language Understanding
Figure 3 for On the Evaluation of Common-Sense Reasoning in Natural Language Understanding
Figure 4 for On the Evaluation of Common-Sense Reasoning in Natural Language Understanding
Viaarxiv icon