Picture for Seongjin Shin

Seongjin Shin

Peri-LN: Revisiting Layer Normalization in the Transformer Architecture

Add code
Feb 04, 2025
Figure 1 for Peri-LN: Revisiting Layer Normalization in the Transformer Architecture
Figure 2 for Peri-LN: Revisiting Layer Normalization in the Transformer Architecture
Figure 3 for Peri-LN: Revisiting Layer Normalization in the Transformer Architecture
Figure 4 for Peri-LN: Revisiting Layer Normalization in the Transformer Architecture
Viaarxiv icon

HyperCLOVA X Technical Report

Add code
Apr 13, 2024
Viaarxiv icon

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Add code
Oct 12, 2023
Viaarxiv icon

On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model

Add code
Apr 28, 2022
Figure 1 for On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model
Figure 2 for On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model
Figure 3 for On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model
Figure 4 for On the Effect of Pretraining Corpora on In-context Learning by a Large-scale Language Model
Viaarxiv icon

Two-stage Textual Knowledge Distillation to Speech Encoder for Spoken Language Understanding

Add code
Oct 25, 2020
Figure 1 for Two-stage Textual Knowledge Distillation to Speech Encoder for Spoken Language Understanding
Figure 2 for Two-stage Textual Knowledge Distillation to Speech Encoder for Spoken Language Understanding
Figure 3 for Two-stage Textual Knowledge Distillation to Speech Encoder for Spoken Language Understanding
Figure 4 for Two-stage Textual Knowledge Distillation to Speech Encoder for Spoken Language Understanding
Viaarxiv icon