Picture for Antoine Bosselut

Antoine Bosselut

Tracking the Limits of Knowledge Propagation: How LLMs Fail at Multi-Step Reasoning with Conflicting Knowledge

Add code
Jan 21, 2026
Viaarxiv icon

Revisiting Multilingual Data Mixtures in Language Model Pretraining

Add code
Oct 29, 2025
Viaarxiv icon

CAVE: Detecting and Explaining Commonsense Anomalies in Visual Environments

Add code
Oct 29, 2025
Viaarxiv icon

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments

Add code
Sep 17, 2025
Figure 1 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 2 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 3 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Figure 4 for Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
Viaarxiv icon

Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining

Add code
Sep 05, 2025
Figure 1 for Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining
Figure 2 for Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining
Figure 3 for Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining
Figure 4 for Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining
Viaarxiv icon

Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization

Add code
Aug 06, 2025
Figure 1 for Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Figure 2 for Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Figure 3 for Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Figure 4 for Parity-Aware Byte-Pair Encoding: Improving Cross-lingual Fairness in Tokenization
Viaarxiv icon

GeoExplorer: Active Geo-localization with Curiosity-Driven Exploration

Add code
Jul 31, 2025
Viaarxiv icon

PERK: Long-Context Reasoning as Parameter-Efficient Test-Time Learning

Add code
Jul 08, 2025
Viaarxiv icon

ConLID: Supervised Contrastive Learning for Low-Resource Language Identification

Add code
Jun 18, 2025
Figure 1 for ConLID: Supervised Contrastive Learning for Low-Resource Language Identification
Figure 2 for ConLID: Supervised Contrastive Learning for Low-Resource Language Identification
Figure 3 for ConLID: Supervised Contrastive Learning for Low-Resource Language Identification
Figure 4 for ConLID: Supervised Contrastive Learning for Low-Resource Language Identification
Viaarxiv icon

Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization

Add code
Jun 16, 2025
Figure 1 for Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization
Figure 2 for Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization
Figure 3 for Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization
Figure 4 for Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization
Viaarxiv icon