Picture for Michael Cogswell

Michael Cogswell

BloomVQA: Assessing Hierarchical Multi-modal Comprehension

Add code
Dec 20, 2023
Viaarxiv icon

A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval

Add code
Nov 30, 2023
Figure 1 for A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
Figure 2 for A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
Figure 3 for A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
Figure 4 for A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
Viaarxiv icon

DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback

Add code
Nov 16, 2023
Figure 1 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Figure 2 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Figure 3 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Figure 4 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Viaarxiv icon

Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models

Add code
Sep 08, 2023
Viaarxiv icon

Probing Conceptual Understanding of Large Visual-Language Models

Add code
Apr 07, 2023
Figure 1 for Probing Conceptual Understanding of Large Visual-Language Models
Figure 2 for Probing Conceptual Understanding of Large Visual-Language Models
Figure 3 for Probing Conceptual Understanding of Large Visual-Language Models
Figure 4 for Probing Conceptual Understanding of Large Visual-Language Models
Viaarxiv icon

Unpacking Large Language Models with Conceptual Consistency

Add code
Sep 29, 2022
Figure 1 for Unpacking Large Language Models with Conceptual Consistency
Figure 2 for Unpacking Large Language Models with Conceptual Consistency
Figure 3 for Unpacking Large Language Models with Conceptual Consistency
Figure 4 for Unpacking Large Language Models with Conceptual Consistency
Viaarxiv icon

Improving Users' Mental Model with Attention-directed Counterfactual Edits

Add code
Oct 15, 2021
Figure 1 for Improving Users' Mental Model with Attention-directed Counterfactual Edits
Figure 2 for Improving Users' Mental Model with Attention-directed Counterfactual Edits
Figure 3 for Improving Users' Mental Model with Attention-directed Counterfactual Edits
Figure 4 for Improving Users' Mental Model with Attention-directed Counterfactual Edits
Viaarxiv icon

Trigger Hunting with a Topological Prior for Trojan Detection

Add code
Oct 15, 2021
Figure 1 for Trigger Hunting with a Topological Prior for Trojan Detection
Figure 2 for Trigger Hunting with a Topological Prior for Trojan Detection
Figure 3 for Trigger Hunting with a Topological Prior for Trojan Detection
Figure 4 for Trigger Hunting with a Topological Prior for Trojan Detection
Viaarxiv icon

Comprehension Based Question Answering using Bloom's Taxonomy

Add code
Jun 08, 2021
Figure 1 for Comprehension Based Question Answering using Bloom's Taxonomy
Figure 2 for Comprehension Based Question Answering using Bloom's Taxonomy
Figure 3 for Comprehension Based Question Answering using Bloom's Taxonomy
Figure 4 for Comprehension Based Question Answering using Bloom's Taxonomy
Viaarxiv icon

Knowing What VQA Does Not: Pointing to Error-Inducing Regions to Improve Explanation Helpfulness

Add code
Mar 31, 2021
Figure 1 for Knowing What VQA Does Not: Pointing to Error-Inducing Regions to Improve Explanation Helpfulness
Figure 2 for Knowing What VQA Does Not: Pointing to Error-Inducing Regions to Improve Explanation Helpfulness
Figure 3 for Knowing What VQA Does Not: Pointing to Error-Inducing Regions to Improve Explanation Helpfulness
Figure 4 for Knowing What VQA Does Not: Pointing to Error-Inducing Regions to Improve Explanation Helpfulness
Viaarxiv icon