text


Semantic World Models

Add code
Oct 22, 2025
Viaarxiv icon

olmOCR 2: Unit Test Rewards for Document OCR

Add code
Oct 22, 2025
Viaarxiv icon

Hubble: a Model Suite to Advance the Study of LLM Memorization

Add code
Oct 22, 2025
Viaarxiv icon

Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing

Add code
Oct 22, 2025
Viaarxiv icon

Class-Aware Prototype Learning with Negative Contrast for Test-Time Adaptation of Vision-Language Models

Add code
Oct 22, 2025
Viaarxiv icon

Metadata Extraction Leveraging Large Language Models

Add code
Oct 22, 2025
Viaarxiv icon

A Training-Free Framework for Open-Vocabulary Image Segmentation and Recognition with EfficientNet and CLIP

Add code
Oct 22, 2025
Viaarxiv icon

Slot Filling as a Reasoning Task for SpeechLLMs

Add code
Oct 22, 2025
Viaarxiv icon

Balancing Rewards in Text Summarization: Multi-Objective Reinforcement Learning via HyperVolume Optimization

Add code
Oct 22, 2025
Viaarxiv icon

KORE: Enhancing Knowledge Injection for Large Multimodal Models via Knowledge-Oriented Augmentations and Constraints

Add code
Oct 22, 2025
Viaarxiv icon