Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

João Neves

Face Super-Resolution Using Stochastic Differential Equations

Sep 24, 2022

Marcelo dos Santos, Rayson Laroca, Rafael O. Ribeiro, João Neves, Hugo Proença, David Menotti

Figure 1 for Face Super-Resolution Using Stochastic Differential Equations

Figure 2 for Face Super-Resolution Using Stochastic Differential Equations

Figure 3 for Face Super-Resolution Using Stochastic Differential Equations

Figure 4 for Face Super-Resolution Using Stochastic Differential Equations

Abstract:Diffusion models have proven effective for various applications such as images, audio and graph generation. Other important applications are image super-resolution and the solution of inverse problems. More recently, some works have used stochastic differential equations (SDEs) to generalize diffusion models to continuous time. In this work, we introduce SDEs to generate super-resolution face images. To the best of our knowledge, this is the first time SDEs have been used for such an application. The proposed method provides an improved peak signal-to-noise ratio (PSNR), structural similarity index measure (SSIM), and consistency than the existing super-resolution methods based on diffusion models. In particular, we also assess the potential application of this method for the face recognition task. A generic facial feature extractor is used to compare the super-resolution images with the ground truth and superior results were obtained compared with other methods. Our code is publicly available at https://github.com/marcelowds/sr-sde

* Accepted for presentation at the Conference on Graphics, Patterns and Images (SIBGRAPI) 2022

Via

Access Paper or Ask Questions

Generative Adversarial Graph Convolutional Networks for Human Action Synthesis

Oct 25, 2021

Bruno Degardin, João Neves, Vasco Lopes, João Brito, Ehsan Yaghoubi, Hugo Proença

Figure 1 for Generative Adversarial Graph Convolutional Networks for Human Action Synthesis

Figure 2 for Generative Adversarial Graph Convolutional Networks for Human Action Synthesis

Figure 3 for Generative Adversarial Graph Convolutional Networks for Human Action Synthesis

Figure 4 for Generative Adversarial Graph Convolutional Networks for Human Action Synthesis

Abstract:Synthesising the spatial and temporal dynamics of the human body skeleton remains a challenging task, not only in terms of the quality of the generated shapes, but also of their diversity, particularly to synthesise realistic body movements of a specific action (action conditioning). In this paper, we propose Kinetic-GAN, a novel architecture that leverages the benefits of Generative Adversarial Networks and Graph Convolutional Networks to synthesise the kinetics of the human body. The proposed adversarial architecture can condition up to 120 different actions over local and global body movements while improving sample quality and diversity through latent space disentanglement and stochastic variations. Our experiments were carried out in three well-known datasets, where Kinetic-GAN notably surpasses the state-of-the-art methods in terms of distribution quality metrics while having the ability to synthesise more than one order of magnitude regarding the number of different actions. Our code and models are publicly available at https://github.com/DegardinBruno/Kinetic-GAN.

* Published as a conference paper at WACV 2022. Code and pretrained models available at https://github.com/DegardinBruno/Kinetic-GAN

Via

Access Paper or Ask Questions

ZSpeedL -- Evaluating the Performance of Zero-Shot Learning Methods using Low-Power Devices

Oct 09, 2021

Cristiano Patrício, João Neves

Figure 1 for ZSpeedL -- Evaluating the Performance of Zero-Shot Learning Methods using Low-Power Devices

Figure 2 for ZSpeedL -- Evaluating the Performance of Zero-Shot Learning Methods using Low-Power Devices

Figure 3 for ZSpeedL -- Evaluating the Performance of Zero-Shot Learning Methods using Low-Power Devices

Figure 4 for ZSpeedL -- Evaluating the Performance of Zero-Shot Learning Methods using Low-Power Devices

Abstract:The recognition of unseen objects from a semantic representation or textual description, usually denoted as zero-shot learning, is more prone to be used in real-world scenarios when compared to traditional object recognition. Nevertheless, no work has evaluated the feasibility of deploying zero-shot learning approaches in these scenarios, particularly when using low-power devices. In this paper, we provide the first benchmark on the inference time of zero-shot learning, comprising an evaluation of state-of-the-art approaches regarding their speed/accuracy trade-off. An analysis to the processing time of the different phases of the ZSL inference stage reveals that visual feature extraction is the major bottleneck in this paradigm, but, we show that lightweight networks can dramatically reduce the overall inference time without reducing the accuracy obtained by the de facto ResNet101 architecture. Also, this benchmark evaluates how different ZSL approaches perform in low-power devices, and how the visual feature extraction phase could be optimized in this hardware. To foster the research and deployment of ZSL systems capable of operating in real-world scenarios, we release the evaluation framework used in this benchmark (https://github.com/CristianoPatricio/zsl-methods).

* 8 pages. Accepted at the 17th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), 2021

Via

Access Paper or Ask Questions

Development of the first Portuguese radar tracking sensor for Space Debris

Feb 20, 2021

João Pandeirada, Miguel Bergano, João Neves, Paulo Marques, Domingos Barbosa, Bruno Coelho, Valério Ribeiro

Figure 1 for Development of the first Portuguese radar tracking sensor for Space Debris

Figure 2 for Development of the first Portuguese radar tracking sensor for Space Debris

Figure 3 for Development of the first Portuguese radar tracking sensor for Space Debris

Figure 4 for Development of the first Portuguese radar tracking sensor for Space Debris

Abstract:Currently, space debris represents a threat for satellites and space-based operations, both in-orbit and during the launching process. The yearly increase in space debris represents a serious concern to major space agencies leading to the development of dedicated space programs to deal with this issue. Ground-based radars can detect Earth orbiting debris down to a few square centimeters and therefore constitute a major building block of a space debris monitoring system. New radar sensors are required in Europe to enhance capabilities and availability of its small radar network capable of tracking and surveying space objects and to respond to the debris increase expected from the New Space economy activities. This article presents ATLAS, a new tracking radar system for debris detection located in Portugal. It starts by an extensive technical description of all the system components followed by a study that estimates its future performance. A section dedicated to waveform design is also presented, since the system allows the usage of several types of pulse modulation schemes such as LFM and phase coded modulations while enabling the development and testing of more advanced ones. By presenting an architecture that is highly modular with fully digital signal processing, ATLAS establishes a platform for fast and easy development, research and innovation. The system follows the use of Commercial-Off-The-Shelf technologies and Open Systems which is unique among current radar systems.

* Reviewed; Accepted for Publication at Signals, MDPI, ISSN 2624-6120, February 2021; 16 pages, 8 Figures

Via

Access Paper or Ask Questions

An Attention-Based Deep Learning Model for Multiple Pedestrian Attributes Recognition

Apr 02, 2020

Ehsan Yaghoubi, Diana Borza, João Neves, Aruna Kumar, Hugo Proença

Figure 1 for An Attention-Based Deep Learning Model for Multiple Pedestrian Attributes Recognition

Figure 2 for An Attention-Based Deep Learning Model for Multiple Pedestrian Attributes Recognition

Figure 3 for An Attention-Based Deep Learning Model for Multiple Pedestrian Attributes Recognition

Figure 4 for An Attention-Based Deep Learning Model for Multiple Pedestrian Attributes Recognition

Abstract:The automatic characterization of pedestrians in surveillance footage is a tough challenge, particularly when the data is extremely diverse with cluttered backgrounds, and subjects are captured from varying distances, under multiple poses, with partial occlusion. Having observed that the state-of-the-art performance is still unsatisfactory, this paper provides a novel solution to the problem, with two-fold contributions: 1) considering the strong semantic correlation between the different full-body attributes, we propose a multi-task deep model that uses an element-wise multiplication layer to extract more comprehensive feature representations. In practice, this layer serves as a filter to remove irrelevant background features, and is particularly important to handle complex, cluttered data; and 2) we introduce a weighted-sum term to the loss function that not only relativizes the contribution of each task (kind of attributed) but also is crucial for performance improvement in multiple-attribute inference settings. Our experiments were performed on two well-known datasets (RAP and PETA) and point for the superiority of the proposed method with respect to the state-of-the-art. The code is available at https://github.com/Ehsan-Yaghoubi/MAN-PAR-.

* Submitted to Image and Vision Computing journal

Via

Access Paper or Ask Questions