Picture for Mayank Kejriwal

Mayank Kejriwal

SelECT-SQL: Self-correcting ensemble Chain-of-Thought for Text-to-SQL

Add code
Sep 16, 2024
Viaarxiv icon

Defining and Evaluating Decision and Composite Risk in Language Models Applied to Natural Language Inference

Add code
Aug 04, 2024
Viaarxiv icon

GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning

Add code
Jul 02, 2024
Viaarxiv icon

Is persona enough for personality? Using ChatGPT to reconstruct an agent's latent personality from simple descriptions

Add code
Jun 18, 2024
Viaarxiv icon

An Evaluation of Estimative Uncertainty in Large Language Models

Add code
May 24, 2024
Viaarxiv icon

Understanding and Estimating Domain Complexity Across Domains

Add code
Dec 20, 2023
Viaarxiv icon

HALO: An Ontology for Representing Hallucinations in Generative Models

Add code
Dec 08, 2023
Viaarxiv icon

How does prompt engineering affect ChatGPT performance on unsupervised entity resolution?

Add code
Oct 09, 2023
Viaarxiv icon

A Knowledge Graph-Based Search Engine for Robustly Finding Doctors and Locations in the Healthcare Domain

Add code
Oct 08, 2023
Viaarxiv icon

A Formalism and Approach for Improving Robustness of Large Language Models Using Risk-Adjusted Confidence Scores

Add code
Oct 05, 2023
Viaarxiv icon