Picture for Kiril Gashteovski

Kiril Gashteovski

Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators

Add code
Mar 25, 2025
Viaarxiv icon

Not-Just-Scaling Laws: Towards a Better Understanding of the Downstream Impact of Language Model Design Decisions

Add code
Mar 05, 2025
Viaarxiv icon

MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis

Add code
Feb 26, 2025
Viaarxiv icon

Evaluating Language Models as Synthetic Data Generators

Add code
Dec 04, 2024
Figure 1 for Evaluating Language Models as Synthetic Data Generators
Figure 2 for Evaluating Language Models as Synthetic Data Generators
Figure 3 for Evaluating Language Models as Synthetic Data Generators
Figure 4 for Evaluating Language Models as Synthetic Data Generators
Viaarxiv icon

Aligning Generalisation Between Humans and Machines

Add code
Nov 23, 2024
Figure 1 for Aligning Generalisation Between Humans and Machines
Figure 2 for Aligning Generalisation Between Humans and Machines
Figure 3 for Aligning Generalisation Between Humans and Machines
Figure 4 for Aligning Generalisation Between Humans and Machines
Viaarxiv icon

LightPAL: Lightweight Passage Retrieval for Open Domain Multi-Document Summarization

Add code
Jun 18, 2024
Viaarxiv icon

AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents

Add code
Apr 09, 2024
Viaarxiv icon

Robust Text Classification: Analyzing Prototype-Based Networks

Add code
Nov 11, 2023
Viaarxiv icon

Linking Surface Facts to Large-Scale Knowledge Graphs

Add code
Oct 23, 2023
Figure 1 for Linking Surface Facts to Large-Scale Knowledge Graphs
Figure 2 for Linking Surface Facts to Large-Scale Knowledge Graphs
Figure 3 for Linking Surface Facts to Large-Scale Knowledge Graphs
Figure 4 for Linking Surface Facts to Large-Scale Knowledge Graphs
Viaarxiv icon

Large Language Models Enable Few-Shot Clustering

Add code
Jul 02, 2023
Viaarxiv icon