Text Extraction From Documents


Text extraction from documents is the process of extracting text data from scanned documents or images.

Towards Leveraging Large Language Model Summaries for Topic Modeling in Source Code

Add code
Apr 24, 2025
Viaarxiv icon

Transformer-Based Extraction of Statutory Definitions from the U.S. Code

Add code
Apr 23, 2025
Viaarxiv icon

Generative AI for Research Data Processing: Lessons Learnt From Three Use Cases

Add code
Apr 22, 2025
Viaarxiv icon

Automatic Text Summarization (ATS) for Research Documents in Sorani Kurdish

Add code
Apr 20, 2025
Viaarxiv icon

Cost-Effective Text Clustering with Large Language Models

Add code
Apr 22, 2025
Viaarxiv icon

FinSage: A Multi-aspect RAG System for Financial Filings Question Answering

Add code
Apr 20, 2025
Viaarxiv icon

Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure

Add code
Apr 14, 2025
Viaarxiv icon

TWSSenti: A Novel Hybrid Framework for Topic-Wise Sentiment Analysis on Social Media Using Transformer Models

Add code
Apr 14, 2025
Viaarxiv icon

RAKG:Document-level Retrieval Augmented Knowledge Graph Construction

Add code
Apr 14, 2025
Viaarxiv icon

GUM-SAGE: A Novel Dataset and Approach for Graded Entity Salience Prediction

Add code
Apr 15, 2025
Viaarxiv icon