Picture for Yiming Ju

Yiming Ju

Training Data for Large Language Model

Add code
Nov 12, 2024
Viaarxiv icon

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency

Add code
Sep 11, 2024
Viaarxiv icon

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies

Add code
Aug 13, 2024
Viaarxiv icon

SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking Mechanisms

Add code
Jun 05, 2024
Viaarxiv icon

KLoB: a Benchmark for Assessing Knowledge Locating Methods in Language Models

Add code
Sep 28, 2023
Viaarxiv icon

Unsupervised Text Style Transfer with Deep Generative Models

Add code
Aug 31, 2023
Viaarxiv icon

Generating Hierarchical Explanations on Text Classification Without Connecting Rules

Add code
Oct 24, 2022
Viaarxiv icon

The Logic Traps in Evaluating Post-hoc Interpretations

Add code
Sep 12, 2021
Figure 1 for The Logic Traps in Evaluating Post-hoc Interpretations
Viaarxiv icon