Picture for Raquel Fernández

Raquel Fernández

Institute for Logic, Language & Computation, University of Amsterdam

Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests

Add code
Feb 20, 2025
Viaarxiv icon

Natural Language Generation from Visual Sequences: Challenges and Future Directions

Add code
Feb 18, 2025
Viaarxiv icon

RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs

Add code
Dec 18, 2024
Figure 1 for RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs
Figure 2 for RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs
Figure 3 for RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs
Figure 4 for RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs
Viaarxiv icon

Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation

Add code
Dec 18, 2024
Viaarxiv icon

Modelling Multimodal Integration in Human Concept Processing with Vision-and-Language Models

Add code
Jul 25, 2024
Viaarxiv icon

Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition

Add code
Jul 05, 2024
Viaarxiv icon

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

Add code
Jun 26, 2024
Figure 1 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 2 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 3 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 4 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Viaarxiv icon

Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

Add code
Jun 19, 2024
Figure 1 for Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation
Figure 2 for Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation
Figure 3 for Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation
Figure 4 for Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation
Viaarxiv icon

MBBQ: A Dataset for Cross-Lingual Comparison of Stereotypes in Generative LLMs

Add code
Jun 11, 2024
Viaarxiv icon

Analysing Cross-Speaker Convergence in Face-to-Face Dialogue through the Lens of Automatically Detected Shared Linguistic Constructions

Add code
May 14, 2024
Viaarxiv icon