Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Beicheng Lou

Wormhole MAML: Meta-Learning in Glued Parameter Space

Dec 28, 2022

Chih-Jung Tracy Chang, Yuan Gao, Beicheng Lou

Figure 1 for Wormhole MAML: Meta-Learning in Glued Parameter Space

Figure 2 for Wormhole MAML: Meta-Learning in Glued Parameter Space

Figure 3 for Wormhole MAML: Meta-Learning in Glued Parameter Space

Figure 4 for Wormhole MAML: Meta-Learning in Glued Parameter Space

Abstract:In this paper, we introduce a novel variation of model-agnostic meta-learning, where an extra multiplicative parameter is introduced in the inner-loop adaptation. Our variation creates a shortcut in the parameter space for the inner-loop adaptation and increases model expressivity in a highly controllable manner. We show both theoretically and numerically that our variation alleviates the problem of conflicting gradients and improves training dynamics. We conduct experiments on 3 distinctive problems, including a toy classification problem for threshold comparison, a regression problem for wavelet transform, and a classification problem on MNIST. We also discuss ways to generalize our method to a broader class of problems.

Via

Access Paper or Ask Questions

Saved You A Click: Automatically Answering Clickbait Titles

Dec 15, 2022

Oliver Johnson, Beicheng Lou, Janet Zhong, Andrey Kurenkov

Abstract:Often clickbait articles have a title that is phrased as a question or vague teaser that entices the user to click on the link and read the article to find the explanation. We developed a system that will automatically find the answer or explanation of the clickbait hook from the website text so that the user does not need to read through the text themselves. We fine-tune an extractive question and answering model (RoBERTa) and an abstractive one (T5), using data scraped from the 'StopClickbait' Facebook pages and Reddit's 'SavedYouAClick' subforum. We find that both extractive and abstractive models improve significantly after finetuning. We find that the extractive model performs slightly better according to ROUGE scores, while the abstractive one has a slight edge in terms of BERTscores.

Via

Access Paper or Ask Questions

Compressed imitation learning

Sep 18, 2020

Nathan Zhao, Beicheng Lou

Figure 1 for Compressed imitation learning

Figure 2 for Compressed imitation learning

Figure 3 for Compressed imitation learning

Figure 4 for Compressed imitation learning

Abstract:In analogy to compressed sensing, which allows sample-efficient signal reconstruction given prior knowledge of its sparsity in frequency domain, we propose to utilize policy simplicity (Occam's Razor) as a prior to enable sample-efficient imitation learning. We first demonstrated the feasibility of this scheme on linear case where state-value function can be sampled directly. We also extended the scheme to scenarios where only actions are visible and scenarios where the policy is obtained from nonlinear network. The method is benchmarked against behavior cloning and results in significantly higher scores with limited expert demonstrations.

Via

Access Paper or Ask Questions

TunaGAN: Interpretable GAN for Smart Editing

Aug 16, 2019

Weiquan Mao, Beicheng Lou, Jiyao Yuan

Figure 1 for TunaGAN: Interpretable GAN for Smart Editing

Figure 2 for TunaGAN: Interpretable GAN for Smart Editing

Figure 3 for TunaGAN: Interpretable GAN for Smart Editing

Figure 4 for TunaGAN: Interpretable GAN for Smart Editing

Abstract:In this paper, we introduce a tunable generative adversary network (TunaGAN) that uses an auxiliary network on top of existing generator networks (Style-GAN) to modify high-resolution face images according to user's high-level instructions, with good qualitative and quantitative performance. To optimize for feature disentanglement, we also investigate two different latent space that could be traversed for modification. The problem of mode collapse is characterized in detail for model robustness. This work could be easily extended to content-aware image editor based on other GANs and provide insight on mode collapse problems in more general settings.

Via

Access Paper or Ask Questions