Picture for Shashank Rajput

Shashank Rajput

Maestro: Uncovering Low-Rank Structures via Trainable Decomposition

Add code
Aug 28, 2023
Viaarxiv icon

Recommender Systems with Generative Retrieval

Add code
May 08, 2023
Viaarxiv icon

The Expressive Power of Tuning Only the Norm Layers

Add code
Feb 15, 2023
Viaarxiv icon

Looped Transformers as Programmable Computers

Add code
Jan 30, 2023
Viaarxiv icon

LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks

Add code
Jun 15, 2022
Figure 1 for LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Figure 2 for LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Figure 3 for LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Figure 4 for LIFT: Language-Interfaced Fine-Tuning for Non-Language Machine Learning Tasks
Viaarxiv icon

Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment

Add code
May 23, 2022
Figure 1 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 2 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 3 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Figure 4 for Utilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment
Viaarxiv icon

Finding Everything within Random Binary Networks

Add code
Oct 22, 2021
Figure 1 for Finding Everything within Random Binary Networks
Figure 2 for Finding Everything within Random Binary Networks
Figure 3 for Finding Everything within Random Binary Networks
Figure 4 for Finding Everything within Random Binary Networks
Viaarxiv icon

Minibatch vs Local SGD with Shuffling: Tight Convergence Bounds and Beyond

Add code
Oct 20, 2021
Viaarxiv icon

An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks

Add code
Jun 14, 2021
Figure 1 for An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks
Figure 2 for An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks
Figure 3 for An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks
Figure 4 for An Exponential Improvement on the Memorization Capacity of Deep Threshold Networks
Viaarxiv icon

Permutation-Based SGD: Is Random Optimal?

Add code
Feb 19, 2021
Figure 1 for Permutation-Based SGD: Is Random Optimal?
Figure 2 for Permutation-Based SGD: Is Random Optimal?
Figure 3 for Permutation-Based SGD: Is Random Optimal?
Viaarxiv icon