Picture for Jonathan Mallinson

Jonathan Mallinson

Offline Regularised Reinforcement Learning for Large Language Models Alignment

Add code
May 29, 2024
Viaarxiv icon

West-of-N: Synthetic Preference Generation for Improved Reward Modeling

Add code
Jan 22, 2024
Viaarxiv icon

Small Language Models Improve Giants by Rewriting Their Outputs

Add code
May 22, 2023
Figure 1 for Small Language Models Improve Giants by Rewriting Their Outputs
Figure 2 for Small Language Models Improve Giants by Rewriting Their Outputs
Figure 3 for Small Language Models Improve Giants by Rewriting Their Outputs
Figure 4 for Small Language Models Improve Giants by Rewriting Their Outputs
Viaarxiv icon

Teaching Small Language Models to Reason

Add code
Dec 19, 2022
Viaarxiv icon

Text Generation with Text-Editing Models

Add code
Jun 14, 2022
Figure 1 for Text Generation with Text-Editing Models
Figure 2 for Text Generation with Text-Editing Models
Figure 3 for Text Generation with Text-Editing Models
Figure 4 for Text Generation with Text-Editing Models
Viaarxiv icon

EdiT5: Semi-Autoregressive Text-Editing with T5 Warm-Start

Add code
May 24, 2022
Figure 1 for EdiT5: Semi-Autoregressive Text-Editing with T5 Warm-Start
Figure 2 for EdiT5: Semi-Autoregressive Text-Editing with T5 Warm-Start
Figure 3 for EdiT5: Semi-Autoregressive Text-Editing with T5 Warm-Start
Figure 4 for EdiT5: Semi-Autoregressive Text-Editing with T5 Warm-Start
Viaarxiv icon

RED-ACE: Robust Error Detection for ASR using Confidence Embeddings

Add code
Mar 14, 2022
Figure 1 for RED-ACE: Robust Error Detection for ASR using Confidence Embeddings
Figure 2 for RED-ACE: Robust Error Detection for ASR using Confidence Embeddings
Figure 3 for RED-ACE: Robust Error Detection for ASR using Confidence Embeddings
Figure 4 for RED-ACE: Robust Error Detection for ASR using Confidence Embeddings
Viaarxiv icon

A Simple Recipe for Multilingual Grammatical Error Correction

Add code
Jun 07, 2021
Figure 1 for A Simple Recipe for Multilingual Grammatical Error Correction
Figure 2 for A Simple Recipe for Multilingual Grammatical Error Correction
Figure 3 for A Simple Recipe for Multilingual Grammatical Error Correction
Figure 4 for A Simple Recipe for Multilingual Grammatical Error Correction
Viaarxiv icon

Felix: Flexible Text Editing Through Tagging and Insertion

Add code
Mar 24, 2020
Figure 1 for Felix: Flexible Text Editing Through Tagging and Insertion
Figure 2 for Felix: Flexible Text Editing Through Tagging and Insertion
Figure 3 for Felix: Flexible Text Editing Through Tagging and Insertion
Figure 4 for Felix: Flexible Text Editing Through Tagging and Insertion
Viaarxiv icon

Controllable Sentence Simplification: Employing Syntactic and Lexical Constraints

Add code
Oct 10, 2019
Figure 1 for Controllable Sentence Simplification: Employing Syntactic and Lexical Constraints
Figure 2 for Controllable Sentence Simplification: Employing Syntactic and Lexical Constraints
Figure 3 for Controllable Sentence Simplification: Employing Syntactic and Lexical Constraints
Figure 4 for Controllable Sentence Simplification: Employing Syntactic and Lexical Constraints
Viaarxiv icon