Picture for Ariel Gera

Ariel Gera

Flash-GMM: A Memory-Efficient Kernel for Scalable Soft Clustering

Add code
Jun 09, 2026
Viaarxiv icon

Teaching Values to Machines: Simulating Human-Like Behavior in LLMs

Add code
May 28, 2026
Viaarxiv icon

Task-Adaptive Embedding Refinement via Test-time LLM Guidance

Add code
May 12, 2026
Viaarxiv icon

Robustness as an Emergent Property of Task Performance

Add code
Feb 03, 2026
Viaarxiv icon

Guided Query Refinement: Multimodal Hybrid Retrieval with Test-Time Optimization

Add code
Oct 06, 2025
Figure 1 for Guided Query Refinement: Multimodal Hybrid Retrieval with Test-Time Optimization
Figure 2 for Guided Query Refinement: Multimodal Hybrid Retrieval with Test-Time Optimization
Figure 3 for Guided Query Refinement: Multimodal Hybrid Retrieval with Test-Time Optimization
Figure 4 for Guided Query Refinement: Multimodal Hybrid Retrieval with Test-Time Optimization
Viaarxiv icon

Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation

Add code
Jun 05, 2025
Viaarxiv icon

An Analysis of Hyper-Parameter Optimization Methods for Retrieval Augmented Generation

Add code
May 06, 2025
Viaarxiv icon

WildIFEval: Instruction Following in the Wild

Add code
Mar 09, 2025
Figure 1 for WildIFEval: Instruction Following in the Wild
Figure 2 for WildIFEval: Instruction Following in the Wild
Figure 3 for WildIFEval: Instruction Following in the Wild
Figure 4 for WildIFEval: Instruction Following in the Wild
Viaarxiv icon

The Mighty ToRR: A Benchmark for Table Reasoning and Robustness

Add code
Feb 26, 2025
Figure 1 for The Mighty ToRR: A Benchmark for Table Reasoning and Robustness
Figure 2 for The Mighty ToRR: A Benchmark for Table Reasoning and Robustness
Figure 3 for The Mighty ToRR: A Benchmark for Table Reasoning and Robustness
Figure 4 for The Mighty ToRR: A Benchmark for Table Reasoning and Robustness
Viaarxiv icon

JuStRank: Benchmarking LLM Judges for System Ranking

Add code
Dec 12, 2024
Figure 1 for JuStRank: Benchmarking LLM Judges for System Ranking
Figure 2 for JuStRank: Benchmarking LLM Judges for System Ranking
Figure 3 for JuStRank: Benchmarking LLM Judges for System Ranking
Figure 4 for JuStRank: Benchmarking LLM Judges for System Ranking
Viaarxiv icon