Picture for Aashu Singh

Aashu Singh

Think Then Embed: Generative Context Improves Multimodal Embedding

Add code
Oct 06, 2025
Viaarxiv icon

RESTRAIN: From Spurious Votes to Signals -- Self-Driven RL with Self-Penalization

Add code
Oct 02, 2025
Viaarxiv icon

Optimizing Recall or Relevance? A Multi-Task Multi-Head Approach for Item-to-Item Retrieval in Recommendation

Add code
Jun 06, 2025
Viaarxiv icon

Transfer between Modalities with MetaQueries

Add code
Apr 08, 2025
Viaarxiv icon

CompCap: Improving Multimodal Large Language Models with Composite Captions

Add code
Dec 06, 2024
Figure 1 for CompCap: Improving Multimodal Large Language Models with Composite Captions
Figure 2 for CompCap: Improving Multimodal Large Language Models with Composite Captions
Figure 3 for CompCap: Improving Multimodal Large Language Models with Composite Captions
Figure 4 for CompCap: Improving Multimodal Large Language Models with Composite Captions
Viaarxiv icon