Picture for Bill Howe

Bill Howe

Know Your Limits: A Survey of Abstention in Large Language Models

Add code
Aug 08, 2024
Figure 1 for Know Your Limits: A Survey of Abstention in Large Language Models
Figure 2 for Know Your Limits: A Survey of Abstention in Large Language Models
Figure 3 for Know Your Limits: A Survey of Abstention in Large Language Models
Figure 4 for Know Your Limits: A Survey of Abstention in Large Language Models
Viaarxiv icon

ML-EAT: A Multilevel Embedding Association Test for Interpretable and Transparent Social Science

Add code
Aug 04, 2024
Viaarxiv icon

Representation Bias of Adolescents in AI: A Bilingual, Bicultural Study

Add code
Aug 04, 2024
Viaarxiv icon

Dataset Scale and Societal Consistency Mediate Facial Impression Bias in Vision-Language AI

Add code
Aug 04, 2024
Viaarxiv icon

Towards Zero-Shot Annotation of the Built Environment with Vision-Language Models (Vision Paper)

Add code
Aug 01, 2024
Viaarxiv icon

The Art of Refusal: A Survey of Abstention in Large Language Models

Add code
Jul 25, 2024
Figure 1 for The Art of Refusal: A Survey of Abstention in Large Language Models
Figure 2 for The Art of Refusal: A Survey of Abstention in Large Language Models
Figure 3 for The Art of Refusal: A Survey of Abstention in Large Language Models
Figure 4 for The Art of Refusal: A Survey of Abstention in Large Language Models
Viaarxiv icon

PathwayBench: Assessing Routability of Pedestrian Pathway Networks Inferred from Multi-City Imagery

Add code
Jul 23, 2024
Figure 1 for PathwayBench: Assessing Routability of Pedestrian Pathway Networks Inferred from Multi-City Imagery
Figure 2 for PathwayBench: Assessing Routability of Pedestrian Pathway Networks Inferred from Multi-City Imagery
Figure 3 for PathwayBench: Assessing Routability of Pedestrian Pathway Networks Inferred from Multi-City Imagery
Figure 4 for PathwayBench: Assessing Routability of Pedestrian Pathway Networks Inferred from Multi-City Imagery
Viaarxiv icon

Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings

Add code
May 27, 2024
Viaarxiv icon

Characterizing LLM Abstention Behavior in Science QA with Context Perturbations

Add code
Apr 18, 2024
Viaarxiv icon

InfoVisDial: An Informative Visual Dialogue Dataset by Bridging Large Multimodal and Language Models

Add code
Dec 21, 2023
Viaarxiv icon