Document Layout Analysis


Document layout analysis (DLA) is the process of analyzing a document's spatial arrangement of content to understand its structure and layout. This includes identifying the location of text, tables, images, and other elements as well as the overall structure, such as headings and subheadings. DLA helps in extracting and categorizing information and automating document processing workflows.

DIMT25@ICDAR2025: HW-TSC's End-to-End Document Image Machine Translation System Leveraging Large Vision-Language Model

Add code
Apr 24, 2025
Viaarxiv icon

DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning

Add code
Apr 05, 2025
Viaarxiv icon

SFDLA: Source-Free Document Layout Analysis

Add code
Mar 24, 2025
Viaarxiv icon

AnnoPage Dataset: Dataset of Non-Textual Elements in Documents with Fine-Grained Categorization

Add code
Mar 28, 2025
Viaarxiv icon

PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction

Add code
Mar 21, 2025
Viaarxiv icon

UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure Analysis

Add code
Mar 20, 2025
Viaarxiv icon

An Efficient Deep Learning-Based Approach to Automating Invoice Document Validation

Add code
Mar 15, 2025
Viaarxiv icon

TextBite: A Historical Czech Document Dataset for Logical Page Segmentation

Add code
Mar 20, 2025
Viaarxiv icon

MarkushGrapher: Joint Visual and Textual Recognition of Markush Structures

Add code
Mar 20, 2025
Viaarxiv icon

EDocNet: Efficient Datasheet Layout Analysis Based on Focus and Global Knowledge Distillation

Add code
Feb 23, 2025
Viaarxiv icon