Picture for Tom Bryan

Tom Bryan

News Deja Vu: Connecting Past and Present with Semantic Search

Add code
Jun 21, 2024
Figure 1 for News Deja Vu: Connecting Past and Present with Semantic Search
Figure 2 for News Deja Vu: Connecting Past and Present with Semantic Search
Figure 3 for News Deja Vu: Connecting Past and Present with Semantic Search
Figure 4 for News Deja Vu: Connecting Past and Present with Semantic Search
Viaarxiv icon

EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge

Add code
Oct 16, 2023
Viaarxiv icon

American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers

Add code
Aug 24, 2023
Figure 1 for American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Figure 2 for American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Figure 3 for American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Figure 4 for American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Viaarxiv icon

Efficient OCR for Building a Diverse Digital History

Add code
Apr 05, 2023
Viaarxiv icon