Text Detection


Detecting the text in the image and localizing it using a bounding box. The text can be in any shape and size. We need to localize all such instances of text in the entire image along with bounding boxes for each word.

LLM-based Semantic Augmentation for Harmful Content Detection

Add code
Apr 22, 2025
Viaarxiv icon

Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts

Add code
Apr 22, 2025
Viaarxiv icon

A Python Tool for Reconstructing Full News Text from GDELT

Add code
Apr 22, 2025
Viaarxiv icon

T2VShield: Model-Agnostic Jailbreak Defense for Text-to-Video Models

Add code
Apr 22, 2025
Viaarxiv icon

Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism Detection

Add code
Apr 21, 2025
Viaarxiv icon

GenCLIP: Generalizing CLIP Prompts for Zero-shot Anomaly Detection

Add code
Apr 21, 2025
Viaarxiv icon

Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues

Add code
Apr 21, 2025
Viaarxiv icon

Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform

Add code
Apr 21, 2025
Viaarxiv icon

Rethinking the Potential of Multimodality in Collaborative Problem Solving Diagnosis with Large Language Models

Add code
Apr 21, 2025
Viaarxiv icon

Real-Time Sentiment Insights from X Using VADER, DistilBERT, and Web-Scraped Data

Add code
Apr 21, 2025
Viaarxiv icon