Picture for Jiajun Shen

Jiajun Shen

Citekit: A Modular Toolkit for Large Language Model Citation Generation

Add code
Aug 06, 2024
Viaarxiv icon

DiPaCo: Distributed Path Composition

Add code
Mar 15, 2024
Viaarxiv icon

Exploring Federated Self-Supervised Learning for General Purpose Audio Understanding

Add code
Feb 05, 2024
Viaarxiv icon

Asynchronous Local-SGD Training for Language Modeling

Add code
Jan 17, 2024
Figure 1 for Asynchronous Local-SGD Training for Language Modeling
Figure 2 for Asynchronous Local-SGD Training for Language Modeling
Figure 3 for Asynchronous Local-SGD Training for Language Modeling
Figure 4 for Asynchronous Local-SGD Training for Language Modeling
Viaarxiv icon

A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames

Add code
Dec 12, 2023
Viaarxiv icon

DiLoCo: Distributed Low-Communication Training of Language Models

Add code
Nov 14, 2023
Figure 1 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 2 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 3 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 4 for DiLoCo: Distributed Low-Communication Training of Language Models
Viaarxiv icon

Adaptive Uncertainty Estimation via High-Dimensional Testing on Latent Representations

Add code
Oct 25, 2023
Viaarxiv icon

L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning

Add code
Jul 14, 2023
Figure 1 for L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning
Figure 2 for L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning
Figure 3 for L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning
Figure 4 for L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning
Viaarxiv icon

Source-Aware Embedding Training on Heterogeneous Information Networks

Add code
Jul 10, 2023
Viaarxiv icon

Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime

Add code
May 03, 2023
Viaarxiv icon