Picture for Jiasheng Ye

Jiasheng Ye

DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels

Add code
Sep 04, 2024
Viaarxiv icon

Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance

Add code
Mar 25, 2024
Viaarxiv icon

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

Add code
Feb 26, 2024
Figure 1 for AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Figure 2 for AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Figure 3 for AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Figure 4 for AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Viaarxiv icon

LLM can Achieve Self-Regulation via Hyperparameter Aware Generation

Add code
Feb 17, 2024
Viaarxiv icon

Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning

Add code
Aug 25, 2023
Viaarxiv icon

DINOISER: Diffused Conditional Sequence Learning by Manipulating Noises

Add code
Feb 20, 2023
Viaarxiv icon

Energy-based Unknown Intent Detection with Data Manipulation

Add code
Jul 27, 2021
Figure 1 for Energy-based Unknown Intent Detection with Data Manipulation
Figure 2 for Energy-based Unknown Intent Detection with Data Manipulation
Figure 3 for Energy-based Unknown Intent Detection with Data Manipulation
Figure 4 for Energy-based Unknown Intent Detection with Data Manipulation
Viaarxiv icon