Picture for Sebastian Schelter

Sebastian Schelter

SemPiper: Interactive Code Synthesis for Semantic Operators in Machine Learning Pipelines

Add code
Jun 12, 2026
Viaarxiv icon

ArtiFact: A Large-Scale Multi-Modal Cultural Heritage Dataset

Add code
Jun 08, 2026
Viaarxiv icon

Be Fair! Can Machine Learning Engineering Agents Adhere to Fairness Constraints?

Add code
Jun 03, 2026
Viaarxiv icon

PrismaDV: Automated Task-Aware Data Unit Test Generation

Add code
Apr 23, 2026
Viaarxiv icon

ERASE -- A Real-World Aligned Benchmark for Unlearning in Recommender Systems

Add code
Mar 09, 2026
Viaarxiv icon

stratum: A System Infrastructure for Massive Agent-Centric ML Workloads

Add code
Mar 05, 2026
Viaarxiv icon

Cost-Efficient RAG for Entity Matching with LLMs: A Blocking-based Exploration

Add code
Feb 05, 2026
Viaarxiv icon

SemPipes -- Optimizable Semantic Data Operators for Tabular Machine Learning Pipelines

Add code
Feb 04, 2026
Viaarxiv icon

Towards Cross-Modal Error Detection with Tables and Images

Add code
Oct 14, 2025
Viaarxiv icon

Towards a Real-World Aligned Benchmark for Unlearning in Recommender Systems

Add code
Aug 23, 2025
Viaarxiv icon