Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Takuya Furusawa

Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models

Dec 24, 2024

Qice Qin, Yuki Hirakawa, Ryotaro Shimizu, Takuya Furusawa, Edgar Simo-Serra

Figure 1 for Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models

Figure 2 for Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models

Figure 3 for Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models

Figure 4 for Fashionability-Enhancing Outfit Image Editing with Conditional Diffusion Models

Abstract:Image generation in the fashion domain has predominantly focused on preserving body characteristics or following input prompts, but little attention has been paid to improving the inherent fashionability of the output images. This paper presents a novel diffusion model-based approach that generates fashion images with improved fashionability while maintaining control over key attributes. Key components of our method include: 1) fashionability enhancement, which ensures that the generated images are more fashionable than the input; 2) preservation of body characteristics, encouraging the generated images to maintain the original shape and proportions of the input; and 3) automatic fashion optimization, which does not rely on manual input or external prompts. We also employ two methods to collect training data for guidance while generating and evaluating the images. In particular, we rate outfit images using fashionability scores annotated by multiple fashion experts through OpenSkill-based and five critical aspect-based pairwise comparisons. These methods provide complementary perspectives for assessing and improving the fashionability of the generated images. The experimental results show that our approach outperforms the baseline Fashion++ in generating images with superior fashionability, demonstrating its effectiveness in producing more stylish and appealing fashion images.

* 11 pages, 6 figures

Via

Access Paper or Ask Questions

An Empirical Analysis of GPT-4V's Performance on Fashion Aesthetic Evaluation

Oct 31, 2024

Yuki Hirakawa, Takashi Wada, Kazuya Morishita, Ryotaro Shimizu, Takuya Furusawa, Sai Htaung Kham, Yuki Saito

Figure 1 for An Empirical Analysis of GPT-4V's Performance on Fashion Aesthetic Evaluation

Figure 2 for An Empirical Analysis of GPT-4V's Performance on Fashion Aesthetic Evaluation

Figure 3 for An Empirical Analysis of GPT-4V's Performance on Fashion Aesthetic Evaluation

Figure 4 for An Empirical Analysis of GPT-4V's Performance on Fashion Aesthetic Evaluation

Abstract:Fashion aesthetic evaluation is the task of estimating how well the outfits worn by individuals in images suit them. In this work, we examine the zero-shot performance of GPT-4V on this task for the first time. We show that its predictions align fairly well with human judgments on our datasets, and also find that it struggles with ranking outfits in similar colors. The code is available at https://github.com/st-tech/gpt4v-fashion-aesthetic-evaluation.

Via

Access Paper or Ask Questions

Mean Field Theory in Deep Metric Learning

Jun 27, 2023

Takuya Furusawa

Figure 1 for Mean Field Theory in Deep Metric Learning

Figure 2 for Mean Field Theory in Deep Metric Learning

Figure 3 for Mean Field Theory in Deep Metric Learning

Figure 4 for Mean Field Theory in Deep Metric Learning

Abstract:In this paper, we explore the application of mean field theory, a technique from statistical physics, to deep metric learning and address the high training complexity commonly associated with conventional metric learning loss functions. By adapting mean field theory for deep metric learning, we develop an approach to design classification-based loss functions from pair-based ones, which can be considered complementary to the proxy-based approach. Applying the mean field theory to two pair-based loss functions, we derive two new loss functions, MeanFieldContrastive and MeanFieldClassWiseMultiSimilarity losses, with reduced training complexity. We extensively evaluate these derived loss functions on three image-retrieval datasets and demonstrate that our loss functions outperform baseline methods in two out of the three datasets.

* 15 pages

Via

Access Paper or Ask Questions