Picture for Daniel Sonntag

Daniel Sonntag

Finetuning Vision-Language-Action Models Requires Fewer Layers Than You Think

Add code
Jun 18, 2026
Viaarxiv icon

FOCA: Future-Oriented Conditioning for Data-Efficient Vision-Language-Action Adaptation

Add code
Jun 18, 2026
Viaarxiv icon

Self-Improving VLA Policies: Selected Diffusion Noise for Spurious-Robust Action Smoothing

Add code
Jun 12, 2026
Viaarxiv icon

The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric

Add code
Feb 13, 2026
Viaarxiv icon

How Many Tokens Do 3D Point Cloud Transformer Architectures Really Need?

Add code
Nov 07, 2025
Viaarxiv icon

S-Chain: Structured Visual Chain-of-Thought For Medicine

Add code
Oct 26, 2025
Figure 1 for S-Chain: Structured Visual Chain-of-Thought For Medicine
Figure 2 for S-Chain: Structured Visual Chain-of-Thought For Medicine
Figure 3 for S-Chain: Structured Visual Chain-of-Thought For Medicine
Figure 4 for S-Chain: Structured Visual Chain-of-Thought For Medicine
Viaarxiv icon

Mitigating Reward Over-optimization in Direct Alignment Algorithms with Importance Sampling

Add code
Jun 11, 2025
Viaarxiv icon

CBM-RAG: Demonstrating Enhanced Interpretability in Radiology Report Generation with Multi-Agent RAG and Concept Bottleneck Models

Add code
Apr 29, 2025
Figure 1 for CBM-RAG: Demonstrating Enhanced Interpretability in Radiology Report Generation with Multi-Agent RAG and Concept Bottleneck Models
Viaarxiv icon

InFL-UX: A Toolkit for Web-Based Interactive Federated Learning

Add code
Mar 06, 2025
Figure 1 for InFL-UX: A Toolkit for Web-Based Interactive Federated Learning
Figure 2 for InFL-UX: A Toolkit for Web-Based Interactive Federated Learning
Viaarxiv icon

MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification

Add code
Feb 11, 2025
Figure 1 for MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification
Figure 2 for MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification
Figure 3 for MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification
Figure 4 for MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification
Viaarxiv icon