Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rasna Goyal

Learning from Rules Generalizing Labeled Exemplars

May 15, 2020

Abhijeet Awasthi, Sabyasachi Ghosh, Rasna Goyal, Sunita Sarawagi

Figure 1 for Learning from Rules Generalizing Labeled Exemplars

Figure 2 for Learning from Rules Generalizing Labeled Exemplars

Figure 3 for Learning from Rules Generalizing Labeled Exemplars

Figure 4 for Learning from Rules Generalizing Labeled Exemplars

Abstract:In many applications labeled data is not readily available, and needs to be collected via pain-staking human supervision. We propose a rule-exemplar method for collecting human supervision to combine the efficiency of rules with the quality of instance labels. The supervision is coupled such that it is both natural for humans and synergistic for learning. We propose a training algorithm that jointly denoises rules via latent coverage variables, and trains the model through a soft implication loss over the coverage and label variables. The denoised rules and trained model are used jointly for inference. Empirical evaluation on five different tasks shows that (1) our algorithm is more accurate than several existing methods of learning from a mix of clean and noisy supervision, and (2) the coupled rule-exemplar supervision is effective in denoising rules.

* ICLR 2020 (Spotlight)

Via

Access Paper or Ask Questions

Parallel Iterative Edit Models for Local Sequence Transduction

Oct 07, 2019

Abhijeet Awasthi, Sunita Sarawagi, Rasna Goyal, Sabyasachi Ghosh, Vihari Piratla

Figure 1 for Parallel Iterative Edit Models for Local Sequence Transduction

Figure 2 for Parallel Iterative Edit Models for Local Sequence Transduction

Figure 3 for Parallel Iterative Edit Models for Local Sequence Transduction

Figure 4 for Parallel Iterative Edit Models for Local Sequence Transduction

Abstract:We present a Parallel Iterative Edit (PIE) model for the problem of local sequence transduction arising in tasks like Grammatical error correction (GEC). Recent approaches are based on the popular encoder-decoder (ED) model for sequence to sequence learning. The ED model auto-regressively captures full dependency among output tokens but is slow due to sequential decoding. The PIE model does parallel decoding, giving up the advantage of modelling full dependency in the output, yet it achieves accuracy competitive with the ED model for four reasons: 1.~predicting edits instead of tokens, 2.~labeling sequences instead of generating sequences, 3.~iteratively refining predictions to capture dependencies, and 4.~factorizing logits over edits and their token argument to harness pre-trained language models like BERT. Experiments on tasks spanning GEC, OCR correction and spell correction demonstrate that the PIE model is an accurate and significantly faster alternative for local sequence transduction.

* Accepted at EMNLP-IJCNLP 2019

Via

Access Paper or Ask Questions