Picture for Benjamin Feuer

Benjamin Feuer

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Add code
Jan 30, 2025
Viaarxiv icon

Hidden in the Noise: Two-Stage Robust Watermarking for Images

Add code
Dec 05, 2024
Viaarxiv icon

SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification

Add code
Oct 07, 2024
Figure 1 for SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification
Figure 2 for SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification
Figure 3 for SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification
Figure 4 for SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification
Viaarxiv icon

Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity

Add code
Jun 25, 2024
Figure 1 for Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity
Figure 2 for Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity
Figure 3 for Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity
Figure 4 for Arboretum: A Large Multimodal Dataset Enabling AI for Biodiversity
Viaarxiv icon

TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks

Add code
Feb 17, 2024
Figure 1 for TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks
Figure 2 for TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks
Figure 3 for TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks
Figure 4 for TuneTables: Context Optimization for Scalable Prior-Data Fitted Networks
Viaarxiv icon

Scaling TabPFN: Sketching and Feature Selection for Tabular Prior-Data Fitted Networks

Add code
Nov 17, 2023
Viaarxiv icon

Exploring Dataset-Scale Indicators of Data Quality

Add code
Nov 07, 2023
Figure 1 for Exploring Dataset-Scale Indicators of Data Quality
Figure 2 for Exploring Dataset-Scale Indicators of Data Quality
Figure 3 for Exploring Dataset-Scale Indicators of Data Quality
Figure 4 for Exploring Dataset-Scale Indicators of Data Quality
Viaarxiv icon

ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models

Add code
Nov 06, 2023
Figure 1 for ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models
Figure 2 for ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models
Figure 3 for ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models
Figure 4 for ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models
Viaarxiv icon

Distributionally Robust Classification on a Data Budget

Add code
Aug 07, 2023
Figure 1 for Distributionally Robust Classification on a Data Budget
Figure 2 for Distributionally Robust Classification on a Data Budget
Figure 3 for Distributionally Robust Classification on a Data Budget
Figure 4 for Distributionally Robust Classification on a Data Budget
Viaarxiv icon

LiT Tuned Models for Efficient Species Detection

Add code
Feb 12, 2023
Viaarxiv icon