Picture for Vaibhav Singh

Vaibhav Singh

Chow-Liu Ordering for Long-Context Reasoning in Chain-of-Agents

Add code
Mar 10, 2026
Viaarxiv icon

Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away

Add code
Feb 11, 2026
Viaarxiv icon

When Data Falls Short: Grokking Below the Critical Threshold

Add code
Nov 06, 2025
Viaarxiv icon

KITE: Kernelized and Information Theoretic Exemplars for In-Context Learning

Add code
Sep 19, 2025
Viaarxiv icon

Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training

Add code
Mar 06, 2025
Figure 1 for Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training
Figure 2 for Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training
Figure 3 for Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training
Figure 4 for Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training
Viaarxiv icon

ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification

Add code
Feb 09, 2025
Figure 1 for ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification
Figure 2 for ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification
Figure 3 for ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification
Figure 4 for ARISE: Iterative Rule Induction and Synthetic Data Generation for Text Classification
Viaarxiv icon

Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

Add code
Nov 27, 2024
Figure 1 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 2 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 3 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 4 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Viaarxiv icon

Machine learning approaches for automatic defect detection in photovoltaic systems

Add code
Sep 24, 2024
Viaarxiv icon

A Three-Pronged Approach to Cross-Lingual Adaptation with Multilingual LLMs

Add code
Jun 25, 2024
Figure 1 for A Three-Pronged Approach to Cross-Lingual Adaptation with Multilingual LLMs
Figure 2 for A Three-Pronged Approach to Cross-Lingual Adaptation with Multilingual LLMs
Figure 3 for A Three-Pronged Approach to Cross-Lingual Adaptation with Multilingual LLMs
Figure 4 for A Three-Pronged Approach to Cross-Lingual Adaptation with Multilingual LLMs
Viaarxiv icon

Controlling Forgetting with Test-Time Data in Continual Learning

Add code
Jun 19, 2024
Viaarxiv icon