Abstract:Motivated by the superior performance of image diffusion models, more and more researchers strive to extend these models to the text-based video editing task. Nevertheless, current video editing tasks mainly suffer from the dilemma between the high fine-tuning cost and the limited generation capacity. Compared with images, we conjecture that videos necessitate more constraints to preserve the temporal consistency during editing. Towards this end, we propose EVE, a robust and efficient zero-shot video editing method. Under the guidance of depth maps and temporal consistency constraints, EVE derives satisfactory video editing results with an affordable computational and time cost. Moreover, recognizing the absence of a publicly available video editing dataset for fair comparisons, we construct a new benchmark ZVE-50 dataset. Through comprehensive experimentation, we validate that EVE could achieve a satisfactory trade-off between performance and efficiency. We will release our dataset and codebase to facilitate future researchers.
Abstract:In this paper, we present a new large-scale dataset for hairstyle recommendation, CelebHair, based on the celebrity facial attributes dataset, CelebA. Our dataset inherited the majority of facial images along with some beauty-related facial attributes from CelebA. Additionally, we employed facial landmark detection techniques to extract extra features such as nose length and pupillary distance, and deep convolutional neural networks for face shape and hairstyle classification. Empirical comparison has demonstrated the superiority of our dataset to other existing hairstyle-related datasets regarding variety, veracity, and volume. Analysis and experiments have been conducted on the dataset in order to evaluate its robustness and usability.
Abstract:Railway systems require regular manual maintenance, a large part of which is dedicated to track deformation inspection. Such deformation might severely impact trains' runtime security, whereas such inspections remain costly as for both finance and manpower. Therefore, a more precise, efficient and automated approach to detect potential railway track deformation is in urgent needs. In this paper, we proposed an applicational framework for predicting vertical track irregularities. Our researches are based on large-scale real-world datasets produced by several operating railways in China. We explored several different sampling methods and compared traditional machine learning algorithms for time-series prediction with popular deep learning techniques. Different ensemble learning methods are also employed for further optimization. The conclusion is reached that neural networks turn out to be the most performant and accurate.