Picture for Yulia Tsvetkov

Yulia Tsvetkov

Explore Theory of Mind: Program-guided adversarial data generation for theory of mind reasoning

Add code
Dec 12, 2024
Viaarxiv icon

ComPO: Community Preferences for Language Model Personalization

Add code
Oct 21, 2024
Figure 1 for ComPO: Community Preferences for Language Model Personalization
Figure 2 for ComPO: Community Preferences for Language Model Personalization
Figure 3 for ComPO: Community Preferences for Language Model Personalization
Figure 4 for ComPO: Community Preferences for Language Model Personalization
Viaarxiv icon

Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence

Add code
Oct 15, 2024
Figure 1 for Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
Figure 2 for Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
Figure 3 for Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
Figure 4 for Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence
Viaarxiv icon

Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only

Add code
Oct 14, 2024
Figure 1 for Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Figure 2 for Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Figure 3 for Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Figure 4 for Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Viaarxiv icon

Biased AI can Influence Political Decision-Making

Add code
Oct 08, 2024
Figure 1 for Biased AI can Influence Political Decision-Making
Figure 2 for Biased AI can Influence Political Decision-Making
Figure 3 for Biased AI can Influence Political Decision-Making
Figure 4 for Biased AI can Influence Political Decision-Making
Viaarxiv icon

Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia

Add code
Oct 05, 2024
Viaarxiv icon

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Add code
Oct 03, 2024
Figure 1 for CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Figure 2 for CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Figure 3 for CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Figure 4 for CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs
Viaarxiv icon

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

Add code
Aug 15, 2024
Viaarxiv icon

Know Your Limits: A Survey of Abstention in Large Language Models

Add code
Aug 08, 2024
Figure 1 for Know Your Limits: A Survey of Abstention in Large Language Models
Figure 2 for Know Your Limits: A Survey of Abstention in Large Language Models
Figure 3 for Know Your Limits: A Survey of Abstention in Large Language Models
Figure 4 for Know Your Limits: A Survey of Abstention in Large Language Models
Viaarxiv icon

The Art of Refusal: A Survey of Abstention in Large Language Models

Add code
Jul 25, 2024
Figure 1 for The Art of Refusal: A Survey of Abstention in Large Language Models
Figure 2 for The Art of Refusal: A Survey of Abstention in Large Language Models
Figure 3 for The Art of Refusal: A Survey of Abstention in Large Language Models
Figure 4 for The Art of Refusal: A Survey of Abstention in Large Language Models
Viaarxiv icon