Picture for Ganesh Jawahar

Ganesh Jawahar

LLM Performance Predictors are good initializers for Architecture Search

Add code
Oct 25, 2023
Viaarxiv icon

Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts

Add code
Jun 08, 2023
Viaarxiv icon

Orca: Progressive Learning from Complex Explanation Traces of GPT-4

Add code
Jun 05, 2023
Viaarxiv icon

AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers

Add code
Oct 14, 2022
Figure 1 for AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
Figure 2 for AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
Figure 3 for AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
Figure 4 for AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers
Viaarxiv icon

Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints

Add code
Oct 06, 2022
Figure 1 for Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Figure 2 for Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Figure 3 for Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Figure 4 for Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
Viaarxiv icon

Automatic Detection of Entity-Manipulated Text using Factual Knowledge

Add code
Mar 19, 2022
Figure 1 for Automatic Detection of Entity-Manipulated Text using Factual Knowledge
Figure 2 for Automatic Detection of Entity-Manipulated Text using Factual Knowledge
Figure 3 for Automatic Detection of Entity-Manipulated Text using Factual Knowledge
Figure 4 for Automatic Detection of Entity-Manipulated Text using Factual Knowledge
Viaarxiv icon

InfoDCL: A Distantly Supervised Contrastive Learning Framework for Social Meaning

Add code
Mar 15, 2022
Figure 1 for InfoDCL: A Distantly Supervised Contrastive Learning Framework for Social Meaning
Figure 2 for InfoDCL: A Distantly Supervised Contrastive Learning Framework for Social Meaning
Figure 3 for InfoDCL: A Distantly Supervised Contrastive Learning Framework for Social Meaning
Figure 4 for InfoDCL: A Distantly Supervised Contrastive Learning Framework for Social Meaning
Viaarxiv icon

Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora

Add code
Dec 28, 2021
Figure 1 for Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora
Figure 2 for Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora
Figure 3 for Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora
Figure 4 for Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora
Viaarxiv icon

Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing

Add code
May 18, 2021
Figure 1 for Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
Figure 2 for Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
Figure 3 for Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
Figure 4 for Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
Viaarxiv icon

Automatic Detection of Machine Generated Text: A Critical Survey

Add code
Nov 02, 2020
Figure 1 for Automatic Detection of Machine Generated Text: A Critical Survey
Figure 2 for Automatic Detection of Machine Generated Text: A Critical Survey
Figure 3 for Automatic Detection of Machine Generated Text: A Critical Survey
Viaarxiv icon