Picture for Moin Nabi

Moin Nabi

Multimodal Autoregressive Pre-training of Large Vision Encoders

Add code
Nov 21, 2024
Figure 1 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Figure 2 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Figure 3 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Figure 4 for Multimodal Autoregressive Pre-training of Large Vision Encoders
Viaarxiv icon

SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF

Add code
Nov 04, 2024
Figure 1 for SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
Figure 2 for SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
Figure 3 for SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
Figure 4 for SALSA: Soup-based Alignment Learning for Stronger Adaptation in RLHF
Viaarxiv icon

Computational Bottlenecks of Training Small-scale Large Language Models

Add code
Oct 25, 2024
Viaarxiv icon

Visual Scratchpads: Enabling Global Reasoning in Vision

Add code
Oct 10, 2024
Figure 1 for Visual Scratchpads: Enabling Global Reasoning in Vision
Figure 2 for Visual Scratchpads: Enabling Global Reasoning in Vision
Figure 3 for Visual Scratchpads: Enabling Global Reasoning in Vision
Figure 4 for Visual Scratchpads: Enabling Global Reasoning in Vision
Viaarxiv icon

KV Prediction for Improved Time to First Token

Add code
Oct 10, 2024
Figure 1 for KV Prediction for Improved Time to First Token
Figure 2 for KV Prediction for Improved Time to First Token
Figure 3 for KV Prediction for Improved Time to First Token
Figure 4 for KV Prediction for Improved Time to First Token
Viaarxiv icon

Contrastive Perplexity for Controlled Generation: An Application in Detoxifying Large Language Models

Add code
Jan 24, 2024
Viaarxiv icon

Semi-supervised learning made simple with self-supervised clustering

Add code
Jun 13, 2023
Viaarxiv icon

A soft nearest-neighbor framework for continual semi-supervised learning

Add code
Dec 09, 2022
Figure 1 for A soft nearest-neighbor framework for continual semi-supervised learning
Figure 2 for A soft nearest-neighbor framework for continual semi-supervised learning
Figure 3 for A soft nearest-neighbor framework for continual semi-supervised learning
Figure 4 for A soft nearest-neighbor framework for continual semi-supervised learning
Viaarxiv icon

miCSE: Mutual Information Contrastive Learning for Low-shot Sentence Embeddings

Add code
Nov 09, 2022
Viaarxiv icon

Mixture-of-experts VAEs can disregard variation in surjective multimodal data

Add code
Apr 11, 2022
Figure 1 for Mixture-of-experts VAEs can disregard variation in surjective multimodal data
Figure 2 for Mixture-of-experts VAEs can disregard variation in surjective multimodal data
Viaarxiv icon