Picture for Éric de la Clergerie

Éric de la Clergerie

CamemBERT 2.0: A Smarter French Language Model Aged to Perfection

Add code
Nov 13, 2024
Viaarxiv icon

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Add code
Apr 11, 2024
Viaarxiv icon

On the Scaling Laws of Geographical Representation in Language Models

Add code
Mar 04, 2024
Viaarxiv icon

Anisotropy Is Inherent to Self-Attention in Transformers

Add code
Jan 24, 2024
Viaarxiv icon

Headless Language Models: Learning without Predicting with Contrastive Weight Tying

Add code
Sep 15, 2023
Viaarxiv icon

Is Anisotropy Inherent to Transformers?

Add code
Jun 13, 2023
Viaarxiv icon

MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling

Add code
Dec 14, 2022
Viaarxiv icon

Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts

Add code
Dec 07, 2020
Figure 1 for Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts
Figure 2 for Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts
Figure 3 for Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts
Figure 4 for Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts
Viaarxiv icon

Multilingual Unsupervised Sentence Simplification

Add code
May 01, 2020
Figure 1 for Multilingual Unsupervised Sentence Simplification
Figure 2 for Multilingual Unsupervised Sentence Simplification
Figure 3 for Multilingual Unsupervised Sentence Simplification
Figure 4 for Multilingual Unsupervised Sentence Simplification
Viaarxiv icon

Controllable Sentence Simplification

Add code
Oct 16, 2019
Figure 1 for Controllable Sentence Simplification
Figure 2 for Controllable Sentence Simplification
Figure 3 for Controllable Sentence Simplification
Figure 4 for Controllable Sentence Simplification
Viaarxiv icon