Picture for Anuj Kumar

Anuj Kumar

North Carolina State University

SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning

Add code
Oct 02, 2025
Figure 1 for SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Figure 2 for SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Figure 3 for SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Figure 4 for SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Viaarxiv icon

Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage

Add code
Oct 02, 2025
Figure 1 for Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Figure 2 for Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Figure 3 for Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Figure 4 for Stream RAG: Instant and Accurate Spoken Dialogue Systems with Streaming Tool Usage
Viaarxiv icon

ConfQA: Answer Only If You Are Confident

Add code
Jun 08, 2025
Figure 1 for ConfQA: Answer Only If You Are Confident
Figure 2 for ConfQA: Answer Only If You Are Confident
Figure 3 for ConfQA: Answer Only If You Are Confident
Figure 4 for ConfQA: Answer Only If You Are Confident
Viaarxiv icon

Proactive Assistant Dialogue Generation from Streaming Egocentric Videos

Add code
Jun 06, 2025
Viaarxiv icon

VisualLens: Personalization through Visual History

Add code
Nov 25, 2024
Figure 1 for VisualLens: Personalization through Visual History
Figure 2 for VisualLens: Personalization through Visual History
Figure 3 for VisualLens: Personalization through Visual History
Figure 4 for VisualLens: Personalization through Visual History
Viaarxiv icon

EgoQR: Efficient QR Code Reading in Egocentric Settings

Add code
Oct 07, 2024
Figure 1 for EgoQR: Efficient QR Code Reading in Egocentric Settings
Figure 2 for EgoQR: Efficient QR Code Reading in Egocentric Settings
Figure 3 for EgoQR: Efficient QR Code Reading in Egocentric Settings
Figure 4 for EgoQR: Efficient QR Code Reading in Egocentric Settings
Viaarxiv icon

Doppelgänger's Watch: A Split Objective Approach to Large Language Models

Add code
Sep 09, 2024
Viaarxiv icon

An Overview and Comparison of Axiomatization Structures Regarding Inconsistency Indices' Properties in Pairwise Comparisons Methods

Add code
Aug 23, 2024
Viaarxiv icon

CRAG -- Comprehensive RAG Benchmark

Add code
Jun 07, 2024
Figure 1 for CRAG -- Comprehensive RAG Benchmark
Figure 2 for CRAG -- Comprehensive RAG Benchmark
Figure 3 for CRAG -- Comprehensive RAG Benchmark
Figure 4 for CRAG -- Comprehensive RAG Benchmark
Viaarxiv icon

Lumos : Empowering Multimodal LLMs with Scene Text Recognition

Add code
Feb 12, 2024
Figure 1 for Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Figure 2 for Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Figure 3 for Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Figure 4 for Lumos : Empowering Multimodal LLMs with Scene Text Recognition
Viaarxiv icon