Picture for Steffen Eger

Steffen Eger

DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?

Add code
Apr 10, 2025
Viaarxiv icon

ContrastScore: Towards Higher Quality, Less Biased, More Efficient Evaluation Metrics with Contrastive Evaluation

Add code
Apr 02, 2025
Viaarxiv icon

TikZero: Zero-Shot Text-Guided Graphics Program Synthesis

Add code
Mar 14, 2025
Viaarxiv icon

BatchGEMBA: Token-Efficient Machine Translation Evaluation with Batched Prompting and Prompt Compression

Add code
Mar 04, 2025
Viaarxiv icon

Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation

Add code
Feb 07, 2025
Viaarxiv icon

PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics

Add code
Dec 20, 2024
Viaarxiv icon

Graph-Guided Textual Explanation Generation Framework

Add code
Dec 16, 2024
Figure 1 for Graph-Guided Textual Explanation Generation Framework
Figure 2 for Graph-Guided Textual Explanation Generation Framework
Figure 3 for Graph-Guided Textual Explanation Generation Framework
Figure 4 for Graph-Guided Textual Explanation Generation Framework
Viaarxiv icon

ScImage: How Good Are Multimodal Large Language Models at Scientific Text-to-Image Generation?

Add code
Dec 03, 2024
Figure 1 for ScImage: How Good Are Multimodal Large Language Models at Scientific Text-to-Image Generation?
Figure 2 for ScImage: How Good Are Multimodal Large Language Models at Scientific Text-to-Image Generation?
Figure 3 for ScImage: How Good Are Multimodal Large Language Models at Scientific Text-to-Image Generation?
Figure 4 for ScImage: How Good Are Multimodal Large Language Models at Scientific Text-to-Image Generation?
Viaarxiv icon

How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs

Add code
Oct 24, 2024
Figure 1 for How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs
Figure 2 for How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs
Figure 3 for How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs
Figure 4 for How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs
Viaarxiv icon

LLM-based multi-agent poetry generation in non-cooperative environments

Add code
Sep 05, 2024
Figure 1 for LLM-based multi-agent poetry generation in non-cooperative environments
Figure 2 for LLM-based multi-agent poetry generation in non-cooperative environments
Figure 3 for LLM-based multi-agent poetry generation in non-cooperative environments
Figure 4 for LLM-based multi-agent poetry generation in non-cooperative environments
Viaarxiv icon