Picture for Christoph Auer

Christoph Auer

IBM Research

Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion

Add code
Jan 27, 2025
Figure 1 for Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion
Figure 2 for Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion
Figure 3 for Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion
Figure 4 for Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion
Viaarxiv icon

Docling Technical Report

Add code
Aug 19, 2024
Figure 1 for Docling Technical Report
Figure 2 for Docling Technical Report
Figure 3 for Docling Technical Report
Figure 4 for Docling Technical Report
Viaarxiv icon

KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents

Add code
May 01, 2024
Figure 1 for KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents
Figure 2 for KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents
Figure 3 for KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents
Figure 4 for KVP10k : A Comprehensive Dataset for Key-Value Pair Extraction in Business Documents
Viaarxiv icon

ESG Accountability Made Easy: DocQA at Your Service

Add code
Nov 30, 2023
Figure 1 for ESG Accountability Made Easy: DocQA at Your Service
Figure 2 for ESG Accountability Made Easy: DocQA at Your Service
Viaarxiv icon

ICDAR 2023 Competition on Robust Layout Segmentation in Corporate Documents

Add code
May 24, 2023
Viaarxiv icon

Optimized Table Tokenization for Table Structure Recognition

Add code
May 05, 2023
Viaarxiv icon

FETA: Towards Specializing Foundation Models for Expert Task Applications

Add code
Sep 08, 2022
Figure 1 for FETA: Towards Specializing Foundation Models for Expert Task Applications
Figure 2 for FETA: Towards Specializing Foundation Models for Expert Task Applications
Figure 3 for FETA: Towards Specializing Foundation Models for Expert Task Applications
Figure 4 for FETA: Towards Specializing Foundation Models for Expert Task Applications
Viaarxiv icon

DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

Add code
Jun 02, 2022
Figure 1 for DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
Figure 2 for DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
Figure 3 for DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
Figure 4 for DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
Viaarxiv icon

Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness

Add code
Jun 01, 2022
Figure 1 for Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness
Figure 2 for Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness
Figure 3 for Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness
Figure 4 for Delivering Document Conversion as a Cloud Service with High Throughput and Responsiveness
Viaarxiv icon

Robust PDF Document Conversion Using Recurrent Neural Networks

Add code
Feb 18, 2021
Figure 1 for Robust PDF Document Conversion Using Recurrent Neural Networks
Figure 2 for Robust PDF Document Conversion Using Recurrent Neural Networks
Figure 3 for Robust PDF Document Conversion Using Recurrent Neural Networks
Figure 4 for Robust PDF Document Conversion Using Recurrent Neural Networks
Viaarxiv icon