Picture for Virginie Do

Virginie Do

Jack

Robust LLM safeguarding via refusal feature adversarial training

Add code
Sep 30, 2024
Figure 1 for Robust LLM safeguarding via refusal feature adversarial training
Figure 2 for Robust LLM safeguarding via refusal feature adversarial training
Figure 3 for Robust LLM safeguarding via refusal feature adversarial training
Figure 4 for Robust LLM safeguarding via refusal feature adversarial training
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Contextual bandits with concave rewards, and an application to fair ranking

Add code
Oct 18, 2022
Figure 1 for Contextual bandits with concave rewards, and an application to fair ranking
Figure 2 for Contextual bandits with concave rewards, and an application to fair ranking
Figure 3 for Contextual bandits with concave rewards, and an application to fair ranking
Figure 4 for Contextual bandits with concave rewards, and an application to fair ranking
Viaarxiv icon

Fast online ranking with fairness of exposure

Add code
Sep 13, 2022
Figure 1 for Fast online ranking with fairness of exposure
Figure 2 for Fast online ranking with fairness of exposure
Figure 3 for Fast online ranking with fairness of exposure
Figure 4 for Fast online ranking with fairness of exposure
Viaarxiv icon

Optimizing generalized Gini indices for fairness in rankings

Add code
Apr 14, 2022
Figure 1 for Optimizing generalized Gini indices for fairness in rankings
Figure 2 for Optimizing generalized Gini indices for fairness in rankings
Viaarxiv icon

Online Approval Committee Elections

Add code
Feb 14, 2022
Viaarxiv icon

Two-sided fairness in rankings via Lorenz dominance

Add code
Oct 28, 2021
Figure 1 for Two-sided fairness in rankings via Lorenz dominance
Figure 2 for Two-sided fairness in rankings via Lorenz dominance
Figure 3 for Two-sided fairness in rankings via Lorenz dominance
Figure 4 for Two-sided fairness in rankings via Lorenz dominance
Viaarxiv icon

Online Selection of Diverse Committees

Add code
May 19, 2021
Figure 1 for Online Selection of Diverse Committees
Figure 2 for Online Selection of Diverse Committees
Figure 3 for Online Selection of Diverse Committees
Figure 4 for Online Selection of Diverse Committees
Viaarxiv icon

e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks

Add code
May 08, 2021
Figure 1 for e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks
Figure 2 for e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks
Figure 3 for e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks
Figure 4 for e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks
Viaarxiv icon

Online certification of preference-based fairness for personalized recommender systems

Add code
Apr 29, 2021
Figure 1 for Online certification of preference-based fairness for personalized recommender systems
Figure 2 for Online certification of preference-based fairness for personalized recommender systems
Figure 3 for Online certification of preference-based fairness for personalized recommender systems
Figure 4 for Online certification of preference-based fairness for personalized recommender systems
Viaarxiv icon