Picture for Jacob Carlson

Jacob Carlson

EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge

Add code
Oct 16, 2023
Viaarxiv icon

American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers

Add code
Aug 24, 2023
Figure 1 for American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Figure 2 for American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Figure 3 for American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Figure 4 for American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Viaarxiv icon

Efficient OCR for Building a Diverse Digital History

Add code
Apr 05, 2023
Viaarxiv icon

LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis

Add code
Mar 29, 2021
Figure 1 for LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis
Figure 2 for LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis
Figure 3 for LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis
Figure 4 for LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis
Viaarxiv icon