Picture for Douwe Kiela

Douwe Kiela

Nearest Neighbor Normalization Improves Multimodal Retrieval

Add code
Oct 31, 2024
Figure 1 for Nearest Neighbor Normalization Improves Multimodal Retrieval
Figure 2 for Nearest Neighbor Normalization Improves Multimodal Retrieval
Figure 3 for Nearest Neighbor Normalization Improves Multimodal Retrieval
Figure 4 for Nearest Neighbor Normalization Improves Multimodal Retrieval
Viaarxiv icon

OLMoE: Open Mixture-of-Experts Language Models

Add code
Sep 03, 2024
Figure 1 for OLMoE: Open Mixture-of-Experts Language Models
Figure 2 for OLMoE: Open Mixture-of-Experts Language Models
Figure 3 for OLMoE: Open Mixture-of-Experts Language Models
Figure 4 for OLMoE: Open Mixture-of-Experts Language Models
Viaarxiv icon

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

Add code
Aug 12, 2024
Figure 1 for Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
Figure 2 for Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
Figure 3 for Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
Figure 4 for Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
Viaarxiv icon

Lynx: An Open Source Hallucination Evaluation Model

Add code
Jul 11, 2024
Figure 1 for Lynx: An Open Source Hallucination Evaluation Model
Figure 2 for Lynx: An Open Source Hallucination Evaluation Model
Figure 3 for Lynx: An Open Source Hallucination Evaluation Model
Figure 4 for Lynx: An Open Source Hallucination Evaluation Model
Viaarxiv icon

Generative Representational Instruction Tuning

Add code
Feb 15, 2024
Viaarxiv icon

KTO: Model Alignment as Prospect Theoretic Optimization

Add code
Feb 02, 2024
Viaarxiv icon

I am a Strange Dataset: Metalinguistic Tests for Language Models

Add code
Jan 10, 2024
Viaarxiv icon

Leveraging Diffusion Perturbations for Measuring Fairness in Computer Vision

Add code
Nov 25, 2023
Figure 1 for Leveraging Diffusion Perturbations for Measuring Fairness in Computer Vision
Figure 2 for Leveraging Diffusion Perturbations for Measuring Fairness in Computer Vision
Figure 3 for Leveraging Diffusion Perturbations for Measuring Fairness in Computer Vision
Figure 4 for Leveraging Diffusion Perturbations for Measuring Fairness in Computer Vision
Viaarxiv icon

FinanceBench: A New Benchmark for Financial Question Answering

Add code
Nov 20, 2023
Figure 1 for FinanceBench: A New Benchmark for Financial Question Answering
Figure 2 for FinanceBench: A New Benchmark for Financial Question Answering
Figure 3 for FinanceBench: A New Benchmark for Financial Question Answering
Figure 4 for FinanceBench: A New Benchmark for Financial Question Answering
Viaarxiv icon

Anchor Points: Benchmarking Models with Much Fewer Examples

Add code
Sep 14, 2023
Viaarxiv icon