Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jiangyue Yu

Deep Image Style Transfer from Freeform Text

Dec 13, 2022

Tejas Santanam, Mengyang Liu, Jiangyue Yu, Zhaodong Yang

Figure 1 for Deep Image Style Transfer from Freeform Text

Figure 2 for Deep Image Style Transfer from Freeform Text

Figure 3 for Deep Image Style Transfer from Freeform Text

Figure 4 for Deep Image Style Transfer from Freeform Text

Abstract:This paper creates a novel method of deep neural style transfer by generating style images from freeform user text input. The language model and style transfer model form a seamless pipeline that can create output images with similar losses and improved quality when compared to baseline style transfer methods. The language model returns a closely matching image given a style text and description input, which is then passed to the style transfer model with an input content image to create a final output. A proof-of-concept tool is also developed to integrate the models and demonstrate the effectiveness of deep image style transfer from freeform text.

Via

Access Paper or Ask Questions

Motif Mining and Unsupervised Representation Learning for BirdCLEF 2022

Jun 08, 2022

Anthony Miyaguchi, Jiangyue Yu, Bryan Cheungvivatpant, Dakota Dudley, Aniketh Swain

Figure 1 for Motif Mining and Unsupervised Representation Learning for BirdCLEF 2022

Figure 2 for Motif Mining and Unsupervised Representation Learning for BirdCLEF 2022

Figure 3 for Motif Mining and Unsupervised Representation Learning for BirdCLEF 2022

Figure 4 for Motif Mining and Unsupervised Representation Learning for BirdCLEF 2022

Abstract:We build a classification model for the BirdCLEF 2022 challenge using unsupervised methods. We implement an unsupervised representation of the training dataset using a triplet loss on spectrogram representation of audio motifs. Our best model performs with a score of 0.48 on the public leaderboard.

* Submitted to CEUR-WS under LifeCLEF for the BirdCLEF 2022 challenge as a working note

Via

Access Paper or Ask Questions