Picture for Marianna Nezhurina

Marianna Nezhurina

DataComp-LM: In search of the next generation of training sets for language models

Add code
Jun 18, 2024
Figure 1 for DataComp-LM: In search of the next generation of training sets for language models
Figure 2 for DataComp-LM: In search of the next generation of training sets for language models
Figure 3 for DataComp-LM: In search of the next generation of training sets for language models
Figure 4 for DataComp-LM: In search of the next generation of training sets for language models
Viaarxiv icon

Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models

Add code
Jun 04, 2024
Viaarxiv icon

Language models scale reliably with over-training and on downstream tasks

Add code
Mar 13, 2024
Figure 1 for Language models scale reliably with over-training and on downstream tasks
Figure 2 for Language models scale reliably with over-training and on downstream tasks
Figure 3 for Language models scale reliably with over-training and on downstream tasks
Figure 4 for Language models scale reliably with over-training and on downstream tasks
Viaarxiv icon

MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies

Add code
Aug 03, 2023
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

Add code
Jun 30, 2022
Figure 1 for BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing
Figure 2 for BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing
Figure 3 for BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing
Figure 4 for BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing
Viaarxiv icon