Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Thales Vieira

A new visual quality metric for Evaluating the performance of multidimensional projections

Jul 23, 2024

Maniru Ibrahim, Thales Vieira

Abstract:Multidimensional projections (MP) are among the most essential approaches in the visual analysis of multidimensional data. It transforms multidimensional data into two-dimensional representations that may be shown as scatter plots while preserving their similarity with the original data. Human visual perception is frequently used to evaluate the quality of MP. In this work, we propose to study and improve on a well-known map called Local Affine Multidimensional Projection (LAMP), which takes a multidimensional instance and embeds it in Cartesian space via moving least squares deformation. We propose a new visual quality metric based on human perception. The new metric combines three previously used metrics: silhouette coefficient, neighborhood preservation, and silhouette ratio. We show that the proposed metric produces more precise results in analyzing the quality of MP than other previously used metrics. Finally, we describe an algorithm that attempts to overcome a limitation of the LAMP method which requires a similar scale for control points and their counterparts in the Cartesian space.

* 19 pages, 10 figures

Via

Access Paper or Ask Questions

Recognizing Handwritten Mathematical Expressions of Vertical Addition and Subtraction

Aug 10, 2023

Daniel Rosa, Filipe R. Cordeiro, Ruan Carvalho, Everton Souza, Sergio Chevtchenko, Luiz Rodrigues, Marcelo Marinho, Thales Vieira, Valmir Macario

Figure 1 for Recognizing Handwritten Mathematical Expressions of Vertical Addition and Subtraction

Figure 2 for Recognizing Handwritten Mathematical Expressions of Vertical Addition and Subtraction

Figure 3 for Recognizing Handwritten Mathematical Expressions of Vertical Addition and Subtraction

Figure 4 for Recognizing Handwritten Mathematical Expressions of Vertical Addition and Subtraction

Abstract:Handwritten Mathematical Expression Recognition (HMER) is a challenging task with many educational applications. Recent methods for HMER have been developed for complex mathematical expressions in standard horizontal format. However, solutions for elementary mathematical expression, such as vertical addition and subtraction, have not been explored in the literature. This work proposes a new handwritten elementary mathematical expression dataset composed of addition and subtraction expressions in a vertical format. We also extended the MNIST dataset to generate artificial images with this structure. Furthermore, we proposed a solution for offline HMER, able to recognize vertical addition and subtraction expressions. Our analysis evaluated the object detection algorithms YOLO v7, YOLO v8, YOLO-NAS, NanoDet and FCOS for identifying the mathematical symbols. We also proposed a transcription method to map the bounding boxes from the object detection stage to a mathematical expression in the LATEX markup sequence. Results show that our approach is efficient, achieving a high expression recognition rate. The code and dataset are available at https://github.com/Danielgol/HME-VAS

* Paper accepted at SIBGRAPI 2023

Via

Access Paper or Ask Questions

User-oriented Natural Human-Robot Control with Thin-Plate Splines and LRCN

May 24, 2021

Bruno Lima, Lucas Amaral, Givanildo Nascimento-Jr, Victor Mafra, Bruno Georgevich Ferreira, Tiago Vieira, Thales Vieira

Figure 1 for User-oriented Natural Human-Robot Control with Thin-Plate Splines and LRCN

Figure 2 for User-oriented Natural Human-Robot Control with Thin-Plate Splines and LRCN

Figure 3 for User-oriented Natural Human-Robot Control with Thin-Plate Splines and LRCN

Figure 4 for User-oriented Natural Human-Robot Control with Thin-Plate Splines and LRCN

Abstract:We propose a real-time vision-based teleoperation approach for robotic arms that employs a single depth-based camera, exempting the user from the need for any wearable devices. By employing a natural user interface, this novel approach leverages the conventional fine-tuning control, turning it into a direct body pose capture process. The proposed approach is comprised of two main parts. The first is a nonlinear customizable pose mapping based on Thin-Plate Splines (TPS), to directly transfer human body motion to robotic arm motion in a nonlinear fashion, thus allowing matching dissimilar bodies with different workspace shapes and kinematic constraints. The second is a Deep Neural Network hand-state classifier based on Long-term Recurrent Convolutional Networks (LRCN) that exploits the temporal coherence of the acquired depth data. We validate, evaluate and compare our approach through both classical cross-validation experiments of the proposed hand state classifier; and user studies over a set of practical experiments involving variants of pick-and-place and manufacturing tasks. Results revealed that LRCN networks outperform single image Convolutional Neural Networks; and that users' learning curves were steep, thus allowing the successful completion of the proposed tasks. When compared to a previous approach, the TPS approach revealed no increase in task complexity and similar times of completion, while providing more precise operation in regions closer to workspace boundaries.

* 15 pages, 9 figures, demo video available in https://youtu.be/Rk3iS_KnaWc

Via

Access Paper or Ask Questions

LRCN-RetailNet: A recurrent neural network architecture for accurate people counting

May 12, 2020

Lucas Massa, Adriano Barbosa, Krerley Oliveira, Thales Vieira

Figure 1 for LRCN-RetailNet: A recurrent neural network architecture for accurate people counting

Figure 2 for LRCN-RetailNet: A recurrent neural network architecture for accurate people counting

Figure 3 for LRCN-RetailNet: A recurrent neural network architecture for accurate people counting

Figure 4 for LRCN-RetailNet: A recurrent neural network architecture for accurate people counting

Abstract:Measuring and analyzing the flow of customers in retail stores is essential for a retailer to better comprehend customers' behavior and support decision-making. Nevertheless, not much attention has been given to the development of novel technologies for automatic people counting. We introduce LRCN-RetailNet: a recurrent neural network architecture capable of learning a non-linear regression model and accurately predicting the people count from videos captured by low-cost surveillance cameras. The input video format follows the recently proposed RGBP image format, which is comprised of color and people (foreground) information. Our architecture is capable of considering two relevant aspects: spatial features extracted through convolutional layers from the RGBP images; and the temporal coherence of the problem, which is exploited by recurrent layers. We show that, through a supervised learning approach, the trained models are capable of predicting the people count with high accuracy. Additionally, we present and demonstrate that a straightforward modification of the methodology is effective to exclude salespeople from the people count. Comprehensive experiments were conducted to validate, evaluate and compare the proposed architecture. Results corroborated that LRCN-RetailNet remarkably outperforms both the previous RetailNet architecture, which was limited to evaluating a single image per iteration; and a state-of-the-art neural network for object detection. Finally, computational performance experiments confirmed that the entire methodology is effective to estimate people count in real-time.

Via

Access Paper or Ask Questions