Picture for Kshitij Gupta

Kshitij Gupta

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Add code
Mar 30, 2024
Viaarxiv icon

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Add code
Mar 26, 2024
Viaarxiv icon

Continual Pre-Training of Large Language Models: How to (re)warm your model?

Add code
Aug 08, 2023
Figure 1 for Continual Pre-Training of Large Language Models: How to (re)warm your model?
Figure 2 for Continual Pre-Training of Large Language Models: How to (re)warm your model?
Figure 3 for Continual Pre-Training of Large Language Models: How to (re)warm your model?
Figure 4 for Continual Pre-Training of Large Language Models: How to (re)warm your model?
Viaarxiv icon

ARB: Advanced Reasoning Benchmark for Large Language Models

Add code
Jul 28, 2023
Viaarxiv icon

Broken Neural Scaling Laws

Add code
Nov 10, 2022
Viaarxiv icon

Data Augmentation for Automated Essay Scoring using Transformer Models

Add code
Oct 29, 2022
Viaarxiv icon

MALM: Mixing Augmented Language Modeling for Zero-Shot Machine Translation

Add code
Oct 01, 2022
Figure 1 for MALM: Mixing Augmented Language Modeling for Zero-Shot Machine Translation
Figure 2 for MALM: Mixing Augmented Language Modeling for Zero-Shot Machine Translation
Viaarxiv icon

cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation

Add code
Jun 09, 2022
Figure 1 for cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation
Figure 2 for cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation
Figure 3 for cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation
Figure 4 for cViL: Cross-Lingual Training of Vision-Language Models using Knowledge Distillation
Viaarxiv icon

Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning

Add code
May 30, 2022
Figure 1 for Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning
Figure 2 for Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning
Figure 3 for Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning
Figure 4 for Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning
Viaarxiv icon

Volta at SemEval-2021 Task 9: Statement Verification and Evidence Finding with Tables using TAPAS and Transfer Learning

Add code
Jun 17, 2021
Figure 1 for Volta at SemEval-2021 Task 9: Statement Verification and Evidence Finding with Tables using TAPAS and Transfer Learning
Figure 2 for Volta at SemEval-2021 Task 9: Statement Verification and Evidence Finding with Tables using TAPAS and Transfer Learning
Figure 3 for Volta at SemEval-2021 Task 9: Statement Verification and Evidence Finding with Tables using TAPAS and Transfer Learning
Figure 4 for Volta at SemEval-2021 Task 9: Statement Verification and Evidence Finding with Tables using TAPAS and Transfer Learning
Viaarxiv icon