Picture for Pranav Shetty

Pranav Shetty

Detecting Non-Membership in LLM Training Data via Rank Correlations

Add code
Mar 24, 2026
Viaarxiv icon

ExStrucTiny: A Benchmark for Schema-Variable Structured Information Extraction from Document Images

Add code
Feb 12, 2026
Viaarxiv icon

Perturb Your Data: Paraphrase-Guided Training Data Watermarking

Add code
Dec 18, 2025
Figure 1 for Perturb Your Data: Paraphrase-Guided Training Data Watermarking
Figure 2 for Perturb Your Data: Paraphrase-Guided Training Data Watermarking
Figure 3 for Perturb Your Data: Paraphrase-Guided Training Data Watermarking
Figure 4 for Perturb Your Data: Paraphrase-Guided Training Data Watermarking
Viaarxiv icon

CoCoLex: Confidence-guided Copy-based Decoding for Grounded Legal Text Generation

Add code
Aug 07, 2025
Viaarxiv icon

Where is this coming from? Making groundedness count in the evaluation of Document VQA models

Add code
Mar 24, 2025
Viaarxiv icon

"What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMs

Add code
Oct 20, 2024
Figure 1 for "What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMs
Figure 2 for "What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMs
Figure 3 for "What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMs
Figure 4 for "What is the value of {templates}?" Rethinking Document Information Extraction Datasets for LLMs
Viaarxiv icon

Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing

Add code
Feb 29, 2024
Figure 1 for Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing
Figure 2 for Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing
Figure 3 for Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing
Figure 4 for Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing
Viaarxiv icon

PolyIE: A Dataset of Information Extraction from Polymer Material Scientific Literature

Add code
Nov 13, 2023
Figure 1 for PolyIE: A Dataset of Information Extraction from Polymer Material Scientific Literature
Figure 2 for PolyIE: A Dataset of Information Extraction from Polymer Material Scientific Literature
Figure 3 for PolyIE: A Dataset of Information Extraction from Polymer Material Scientific Literature
Figure 4 for PolyIE: A Dataset of Information Extraction from Polymer Material Scientific Literature
Viaarxiv icon

Cross-Geography Generalization of Machine Learning Methods for Classification of Flooded Regions in Aerial Images

Add code
Oct 04, 2022
Figure 1 for Cross-Geography Generalization of Machine Learning Methods for Classification of Flooded Regions in Aerial Images
Figure 2 for Cross-Geography Generalization of Machine Learning Methods for Classification of Flooded Regions in Aerial Images
Figure 3 for Cross-Geography Generalization of Machine Learning Methods for Classification of Flooded Regions in Aerial Images
Figure 4 for Cross-Geography Generalization of Machine Learning Methods for Classification of Flooded Regions in Aerial Images
Viaarxiv icon

A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language Processing

Add code
Sep 27, 2022
Figure 1 for A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language Processing
Figure 2 for A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language Processing
Figure 3 for A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language Processing
Figure 4 for A general-purpose material property data extraction pipeline from large polymer corpora using Natural Language Processing
Viaarxiv icon