Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yongming Zhang

TexControl: Sketch-Based Two-Stage Fashion Image Generation Using Diffusion Model

May 07, 2024

Yongming Zhang, Tianyu Zhang, Haoran Xie

Abstract:Deep learning-based sketch-to-clothing image generation provides the initial designs and inspiration in the fashion design processes. However, clothing generation from freehand drawing is challenging due to the sparse and ambiguous information from the drawn sketches. The current generation models may have difficulty generating detailed texture information. In this work, we propose TexControl, a sketch-based fashion generation framework that uses a two-stage pipeline to generate the fashion image corresponding to the sketch input. First, we adopt ControlNet to generate the fashion image from sketch and keep the image outline stable. Then, we use an image-to-image method to optimize the detailed textures of the generated images and obtain the final results. The evaluation results show that TexControl can generate fashion images with high-quality texture as fine-grained image generation.

* 5 pages, 8 figures, accepted in NICOGRAPH International 2024

Via

Access Paper or Ask Questions

Video Smoke Detection Based on Deep Saliency Network

Sep 08, 2018

Gao Xu, Yongming Zhang, Qixing Zhang, Gaohua Lin, Yang Jia, Jinjun Wang

Figure 1 for Video Smoke Detection Based on Deep Saliency Network

Figure 2 for Video Smoke Detection Based on Deep Saliency Network

Figure 3 for Video Smoke Detection Based on Deep Saliency Network

Figure 4 for Video Smoke Detection Based on Deep Saliency Network

Abstract:Video smoke detection is a promising fire detection method especially in open or large spaces and outdoor environments. Traditional smoke detection consists of candidate region extraction and classification, but it lacks powerful characterization for smoke. In this paper, we propose a novel method for video smoke detection based on deep saliency network. Visual saliency detection aims to highlight the most important object regions in an image. The pixel-level and object-level salient CNNs are combined to extract the informative smoke saliency map. For the need of application for smoke event detection, an end-to-end framework for salient smoke detection and existence prediction of smoke is proposed. The deep feature map is combined with the saliency map to predict the existence of smoke in image. Initial dataset and augmented dataset are built to measure the performance of frameworks with different design strategies. Qualitative and quantitative analysis at frame-level and pixel-level demonstrates the excellent performance of the ultimate framework.

* 20 pages, 13 figures

Via

Access Paper or Ask Questions

Domain Adaptation from Synthesis to Reality in Single-model Detector for Video Smoke Detection

May 18, 2018

Gao Xu, Yongming Zhang, Qixing Zhang, Gaohua Lin, Jinjun Wang

Figure 1 for Domain Adaptation from Synthesis to Reality in Single-model Detector for Video Smoke Detection

Figure 2 for Domain Adaptation from Synthesis to Reality in Single-model Detector for Video Smoke Detection

Figure 3 for Domain Adaptation from Synthesis to Reality in Single-model Detector for Video Smoke Detection

Figure 4 for Domain Adaptation from Synthesis to Reality in Single-model Detector for Video Smoke Detection

Abstract:This paper proposes a method for video smoke detection using synthetic smoke samples. The virtual data can automatically offer precise and rich annotated samples. However, the learning of smoke representations will be hurt by the appearance gap between real and synthetic smoke samples. The existed researches mainly work on the adaptation to samples extracted from original annotated samples. These methods take the object detection and domain adaptation as two independent parts. To train a strong detector with rich synthetic samples, we construct the adaptation to the detection layer of state-of-the-art single-model detectors (SSD and MS-CNN). The training procedure is an end-to-end stage. The classification, location and adaptation are combined in the learning. The performance of the proposed model surpasses the original baseline in our experiments. Meanwhile, our results show that the detectors based on the adversarial adaptation are superior to the detectors based on the discrepancy adaptation. Code will be made publicly available on http://smoke.ustc.edu.cn. Moreover, the domain adaptation for two-stage detector is described in Appendix A.

* The manuscript approved by all authors is our original work, and has submitted to Pattern Recognition for peer review previously. There are 4532 words, 6 figures and 1 table in this manuscript

Via

Access Paper or Ask Questions

Deep Domain Adaptation Based Video Smoke Detection using Synthetic Smoke Images

Mar 31, 2017

Gao Xu, Yongming Zhang, Qixing Zhang, Gaohua Lin, Jinjun Wang

Figure 1 for Deep Domain Adaptation Based Video Smoke Detection using Synthetic Smoke Images

Figure 2 for Deep Domain Adaptation Based Video Smoke Detection using Synthetic Smoke Images

Figure 3 for Deep Domain Adaptation Based Video Smoke Detection using Synthetic Smoke Images

Figure 4 for Deep Domain Adaptation Based Video Smoke Detection using Synthetic Smoke Images

Abstract:In this paper, a deep domain adaptation based method for video smoke detection is proposed to extract a powerful feature representation of smoke. Due to the smoke image samples limited in scale and diversity for deep CNN training, we systematically produced adequate synthetic smoke images with a wide variation in the smoke shape, background and lighting conditions. Considering that the appearance gap (dataset bias) between synthetic and real smoke images degrades significantly the performance of the trained model on the test set composed fully of real images, we build deep architectures based on domain adaptation to confuse the distributions of features extracted from synthetic and real smoke images. This approach expands the domain-invariant feature space for smoke image samples. With their approximate feature distribution off non-smoke images, the recognition rate of the trained model is improved significantly compared to the model trained directly on mixed dataset of synthetic and real images. Experimentally, several deep architectures with different design choices are applied to the smoke detector. The ultimate framework can get a satisfactory result on the test set. We believe that our approach is a start in the direction of utilizing deep neural networks enhanced with synthetic smoke images for video smoke detection.

* Fire Safety Journal 93C (2017) pp. 53-59
* The manuscript approved by all authors is our original work, and has submitted to Fire Safety Journal for peer review previously. There are 4516 words, 8 figures and 2 tables in this manuscript

Via

Access Paper or Ask Questions